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(54) METHOD OF GENERATING MULTIMEDIA STREAM WHICH ENABLES SELECTIVE 

REPRODUCTION OF VIDEO DATA AND MULTIMEDIA OPTICAL DISK AUTHORING SYSTEM 



(57) The present invention provides an authoring 
system for variously processing a bitstream carrying 
information about moving picture data, audio data, and 
subpicture data constituting titles having a sequence of 
related content, generating a bitstream constituting a 
title with content corresponding to a user selection, effi- 
ciently recording said generated bitstream to a specific 
recording medium, and reproducing title content corre- 
sponding to a user selection from the bitstream thus 
generated. Editing content input by a user using a key- 
board or other means for the multimedia bitstream con- 
tent that is the source data is converted to authoring 
encoding parameters; whether certain conditions deter- 
mined by the data structure and other considerations of 
the authoring system are met is determined; and if the 
conditions are not satisfied, feedback is given to the 
user to prompt re-entry of the editing information. 
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Description 
Technical Field 

The present invention relates to an authoring sys- s 
tern for variously processing a bitstream carrying video 
information representing individual titles having content 
with a particular sequential relationship, audio data, and 
ancillary video data; generating a bitstream whereby a 
title with user-selectable content can be constructed; 
efficiently recording to a particular recording medium 
said generated bitstream; and reproducing title content 
according to a user-selected sequence from said gener- 
ated bitstream. More specifically, the present invention 
relates to a method for generating a multimedia stream 
comprising mutually related audio data and video data, 
and to a multimedia optical disk authoring system for 
storing said multimedia stream as digital data. 

Background Art 

Authoring systems that are part of systems using 
analog video and video CDs for producing titles having 
content with a particular sequential relationship by digit- 
ally processing multimedia data such as video data, 
audio data, and ancillary video data have been devel- 
oped in recent years. MPEG data is one example of a 
multimedia stream comprising such audio information 
and video information. MPEG data recording media 
include Video CD, and an authoring system therefor is 
based on a workstation. A Video CD system is able to 
record video data to a CD medium, which is usually 
used for digital audio recording and has a recording 
capacity of approximately 600 megabytes, by using a 
high efficiency MPEG video compression method. 

In this type of authoring system, elementary 
streams of video and audio information are first 
encoded, and then system stream encoding is applied 
to these elementary streams to generate an MPEG 
stream. A reproduction path, which is a reproduction 
sequence of an MPEG stream, is then determined to 
enable reproduction of content according to a user- 
selected scenario. This scenario information and MPEG 
stream are then multiplexed, converted to a disc image 
of the CD medium, and then recorded to a CD medium 
to create a master disc. This master disc is then dupli- 
cated by means of a pressing process or other method 
to produce discs for distribution. 

More recently, an optical disc recording medium 
known as DVD has been introduced with a greater stor- 
age capacity than Video CD. Video with an extended 
play time can be stored to DVD media, and this capacity 
can be used to achieve such desirable but convention- 
ally unavailable functions as an alternative path play- 
back function for video data. Alternative path playback 
is a function whereby video data from plural related 
paths is divided into specific segments, which are then 
multiplexed on an optical disc. A disc reproduction 



apparatus then reproduces only the video data seg- 
ments selected from among a plurality of multiplexed 
video data segments, skipping those multiplexed seg- 
ments not associated with a selected reproduction path. 
These multiplexed sections are called "alternative video 
reproduction periods." 

An example of an alternative reproduction function 
is the parental lock playback function whereby the play- 
back video is selectively reproduced according to view- 
ing restriction information. More specifically, this 
function enables the selective presentation or non-pres- 
entation of, for example, violent scenes in a movie. A 
further example of an applied alternative reproduction 
function is multiviewpoint playback in which images 
from different viewing angles are selectively repro- 
duced. A specific example of this is a baseball game in 
which the imaging angle of the video playback can be 
selectively changed during a live broadcast to images 
presented from, for example, the batter's viewpoint the 
pitcher's viewpoint, or the viewpoint of a spectator's 
seat. 

Because alternative video reproduction periods are 
skipped during playback, associated video data must 
satisfy a wide range of restrictive conditions relating to 
encoder conditions and video data combinations. How- 
ever, conventional Video CD and other authoring sys- 
tem processes, in principle, generate all MPEG data 
according to the same conditions. As a result, directly 
applying this process will result in the generation of 
defective alternative video reproduction periods as a 
result of video data not satisfying these conditions. 
Discs containing such defective alternative video repro- 
duction periods will cause the disc reproduction appara- 
tus to misoperate during playback 

Furthermore, such defective alternative video 
reproduction periods cannot be detected until the seg- 
ment is played back after the master disc is completed. 
As a result the title producer must re-author the master 
disc starting from the encoding process, imposing a sig- 
nificant burden. The magnitude of this burden is partic- 
ularly severe during MPEG-2 encoding. MPEG-2 
encoding in general provides significantly higher quality 
audio and video streams compared with MPEG-1, and 
the playback time is approximately double. This is 
because the video quality in MPEG-2 is determined by 
such factors as the bit rate and other parameters speci- 
fied during encoding, and by the filters used. A two-pass 
encoding process is therefore used. In this two-pass 
process, the encoded image quality is checked, param- 
eters are readjusted to achieve the image quality that 
can be provided with MPEG-2, and the final image is 
then produced through a second encoding pass. As will 
thus be obvious, a significant amount of work is required 
to generate a multimedia stream containing alternative 
video reproduction periods. 

The compression method and data syntax of 
MPEG-2 are also slightly different from those used in 
MPEG-1. The content of and differences between 



15 



20 



25 



30 



35 



40 



45 



50 



2 



3 



EP0 875 856 A1 



4 



MPEG-1 and MPEG-2 are described in detail in the 
MPEG standards ISO-11172 and ISO-13818, and fur- 
ther description thereof is thus omitted below. While the 
structure of the video encoding stream is defined in 
MPEG-2, a hierarchical structure for the system stream 
and a method for processing low hierarchy levels are 
not clearly defined. 

In consideration of the above problem, an object of 
the present invention is to provide a generating method 
for efficiently generating a multimedia stream containing 
alternative video reproduction periods and an authoring 
system therefor. 

Disclosure of Invention 

An authoring system for generating a bitstream 
constituting a title having content corresponding to a 
user selection by applying a variety of processes to a 
source stream carrying video data, audio data, and 
ancillary video data constituting titles having content 
with a particular sequential relationship, said authoring 
system comprising a means for presenting source 
stream content in editing units, and a means for gener- 
ating edit command data for the presented editing units 
(VOB). 

Brief Description of Drawings 

Fig. 1 is a typical diagram of the data structure of a 
multimedia bitstream. 

Fig. 2 is a block diagram of the structure of an 
authoring encoder comprising the editing information 
generator shown in Fig. 38. 

Fig. 3 is a block diagram of the structure of a DVD 
authoring decoder according to the present invention. 

Fig. 4 is a typical diagram of an exemplary multi- 
rated title stream. 

Fig. 5 is a typical diagram of a VTS data structure, 
which is part of a bitstream according to the present 
invention. 

Fig. 6 is a typical diagram of a detailed data struc- 
ture of a system stream. 

Fig. 7 is a typical diagram of a packet data structure 
of a system stream. 

Fig. 8 is a typical diagram of a data structure of an 
NV packet according to the present invention. 

Fig. 9 is a typical diagram of an exemplary multi- 
scene control scenario in a DVD system according to 
the present invention. 

Fig. 10 is a typical diagram of system stream con- 
nection during multiviewpoint control. 

Fig. 11 is a typical diagram of an exemplary VOB 
during multiscene control. 

Fig. 12 is a block diagram of a DVD authoring 
encoder according to the present invention. 

Fig. 13 is a typical diagram of a VOB set data 
sequence according to the present invention. 

Fig. 14 is a typical diagram of a VOB data stream 



structure according to the present invention. 

Fig. 15 is a typical diagram of an encoding parame- 
ter structure according to the present invention. 

Fig. 16 is a typical diagram of a program chain 
5 structure in DVD multiscene control according to the 
present invention. 

Fig. 17 is a typical diagram of a VOB structure in 
DVD multiscene control according to the present inven- 
tion. 

10 Fig. 18 is a typical diagram of a data structure for 

search information in a navigation pack NV according to 

the present invention. 

Fig. 19 is a typical diagram showing the concept of 

multi-angle control. 
75 Fig. 20 is a flow chart showing a first part of an 

encoding control method according to the present 

invention. 

Fig. 21 is a flow chart showing a second part of the 
encoding control method according to the present 
20 invention shown in Fig. 20. 

Fig. 22 is a flow chart showing the encoding param- 
eter generation process for a non-seamless switching 
stream in multi-angle control. 

Fig. 23 is a flow chart showing the detailed opera- 
25 tion of the common VOB data setting routine shown in 
Fig. 22. 

Fig. 24 is a flow chart showing the encoding param- 
eter generation process for a seamless switching 
stream in multi-angle control: * ~ 

30 Fig. 25 is a flow chart showing the encoding param- 
eter generation process for parental lock control. 

Fig. 26 is a flow chart showing the encoding param- 
eter generation process for a single scene. 

Fig. 27 is a flow chart of the operation of a DVD 
35 encoder according to the present invention shown in 
Fig. 12. 

Fig. 28 is a flow chart of the operation of a formatter 
for non-seamless switching multi-angle control accord- 
ing to the present invention shown in Fig. 12. 
40 Fig. 29 is a flow chart of the operation of a formatter 
for seamless switching multi-angle control according to 
the present invention shown in Fig. 12. 

Fig. 30 is a flow chart of the operation of a formatter 
for parental lock control according to the present inven- 
ts tion shown in Fig. 12. 

Fig. 31 is a flow chart of the operation of a formatter 
for a single scene according to the present invention 
shown in Fig. 12. 

Fig. 32 is a typical diagram showing the structure of 
so a decoding system table according to the present inven- 
tion. 

Fig. 33 is a typical diagram showing the structure of 
a decoding table according to the present invention. 

Fig. 34 is a typical diagram showing the structure of 
55 system stream data. 

Fig. 35 is a typical diagram showing the data struc- 
ture in contiguous blocks. 

Fig. 36 is a typical diagram showing the data struc- 
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ture in interleaved blocks. 

Fig. 37 is a block diagram of the structure of a first 
embodiment of an editing information generator accord- 
ing to the present invention shown in Fig. 12. 

Fig. 38 is a block diagram of the structure of a sec- 
ond embodiment of an editing information generator 
according to the present invention shown in Fig. 12. 

Fig. 39 is a flow chart of the operation of the editing 
information generator shewn in Fig. 37 and Fig. 39. 

Fig. 40 is a block diagram of the structure of a sce- 
nario selector according to the present invention shown 
in Fig. 3. 

fig. 41 is an exemplary diagram of a reproduction 
apparatus for a DVD, which is an optical disc to which is 
recorded a multimedia bitstream generated by a multi- 
media optical disc authoring system according to the 
present invention. 

Fig. 42 is used to describe the video data of an opti- 
cal disc according to the present invention, and the 
reproduction sequence thereof. 

fig. 43 is used to describe the structure of a video 
table VTBL 

fig. 44 is used to describe the structure of an audio 
table ATBL 

Fig. 45 is a block diagram of the structure of a third 
embodiment of an editing information generator accord- 
ing to the present invention shown in Fig. 12. 

fig. 46 is a flow chart showing the verification oper- 
ation for whether an alternative reproduction period is a 
multi-angle control period or a parental lock control 
period. 

Best Mode for Carrying Out the Invention 



(1.1> 


Data structure of an authoring system 


<1.2> 


Authoring encoder EC 


(1.3) 


Scenario data St7 generation 


<1.4> 


Authoring decoder DCD 


<1.5> 


Scenario selection data St51 generation 


(2.1) 


Parental lock control and muftiangle control 


(2.2) 


Segmenting an alternative reproduction 




period for parental lock control 


<2.3) 


An example of limit values of an alternative 




reproduction period for parental lock control 


<2.4> 


Dividing an alternative reproduction period 




for angles 


(2.5) 


Multiscene control 


<3.1) 


Data structure in a DVD system 


(3.2) 


DVD encoder 


(3.3) 


DVD decoder 


(3.3.1) 


Multiscene 


(3.3.2) 


Seamless playback 


(3.3.3) 


Details of seamless playback 


(3.4.1) 


Interleaving 


(3.4.2) 


Definition of interleaving 


(3.4.3) 


Structure of interleave blocks and units 


(3.5.1) 


Multiscene control 


(3.5.2) 


Parental control 



(3.5.3) 


Multi-angle control 


(3.6.1) 


Flew chart: encoder 


(3.6.2) 


Formatter flow chart descriptions 


(3.7) 


Decoder flow charts 


(3.7.1) 


Disk-to-stream buffer transfer 


(3.8) 


DVD player 



(1.1 )Data structure of an authoring system 



10 The logic structure of a multimedia data bitstream 
processed by a recording apparatus, recording medium, 
and reproduction apparatus according to the present 
invention, and by an authoring system comprising the 
function thereof, is described first below with reference 

is to fig. 1 . Video and audio information of which the con- 
tent can be recognized, understood, or enjoyed by a 
user is defined as one title. With reference to a movie, 
for example, a title contains the information used to 
express, in the broadest context, the complete content 

20 of a single movie, and, in the smallest context, the con- 
tent of each individual scene. 

A video title set VTS comprises bitstream data con- 
taining information for a specific number of titles. For 
brevity, a video title set VTS is referred to below as sim- 

25 ply a VTS. A VTS contains reproduction data, such as 
the video and audio representing the actual content of 
each title, and control data for controlling the reproduc- 
tion data. 

A video zone VZ, which is one video data unit in an 
so authoring system, is formed from a specific number of 
VTS. For brevity, a video zone VZ is referred to below as 
simply a VZ. K+1 VTS numbered VTS #0 to VTS #K, 
where K is a positive integer including zero, are 
arranged in linear sequence in one VZ. One of these 
35 VTS, preferably the first VTS #0, is used as a video 
manager for manifesting the title content contained in 
each VTS. 

A multimedia bitstream MBS, which is the largest 
management unit of a bitstream of multimedia data in 
40 an authoring system, is formed from a specific number 
of VZ comprised as described above. 

( 1 .2 ) Authoring encoder EC 

45 A preferred embodiment of an authoring encoder 
EC based on the present invention for generating a new 
multimedia bitstream MBS by encoding an original mul- 
timedia bitstream according to a user-defined scenario 
is shown in fig. 2. Note that an original multimedia bit- 

so stream comprises a video stream St1 for carrying video 
information, a subpicture stream St3 for carrying cap- 
tions and other ancillary video information, and an audio 
stream St5 for carrying audio information. The video 
stream and audio stream are streams containing video 

55 and audio information obtained from a target during a 
specified period. The subpicture stream is a stream 
containing video information for one screen, that is, 
momentary video information. If necessary, the subpic- 
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ture for one screen can be captured to video memory, 
for example, and the captured subpicture screen can be 
continuously displayed. 

During a live broadcast, multimedia source data 
St 1 , St3, and St5 are supplied in real-time as video and 
audio signals from a video camera or other means. 
They may also be non-real-time video and audio signals 
reproduced from video tape or other recording medium. 
It should be noted that three different multimedia source 
streams are shown in the figure for simplicity, and it will 
be obvious that three or more source data streams each 
representing different title content can be input. Multi- 
media source data comprising audio, video, and ancil- 
lary video information for a plurality of titles is referred to 
as a multi-title stream. 

An authoring encoder EC comprises an editing 
information generator 100, encoding system controller 
200, video encoder 300, video stream buffer 400, sub- 
picture encoder 500, subpicture stream buffer 600, 
audio encoder 700, audio stream buffer 800, system 
encoder 900, video zone formatter 1300, recording sec- 
tion 1200, and recording medium M. A bitstream 
encoded by an encoder according to the present inven- 
tion as shown in the figure is recorded to, for example, 
an optical disk medium. 

The authoring encoder EC comprises an editing 
information generator 100 for outputting scenario data 
instructing editing the parts of the multimedia bitstream 
MBS corresponding to the user selections of video, sub- 
picture, and audio information in an original multimedia 
title. An editing information generator 100 preferably 
comprises a display unit, speaker unit, keyboard, CPU, 
and source stream buffer means. The editing informa- 
tion generator 100 is connected to the above-described 
external multimedia stream source from which the mul- 
timedia source data St1, St3, and St5 are supplied. 

A user can reproduce video and audio in the multi- 
media source data on the display unit and speaker unit 
to recognize the title content. While confirming the 
reproduced content, a user can enter content editing 
commands according to a desired scenario using the 
keyboard unit. Note that editing command content 
refers to information describing whether all or part of the 
source data containing content for a plurality of titles is 
selected, describes what source data content is 
selected at specific time intervals, and describes how 
the selected content is connected and reproduced. 

Based on the keyboard input, the CPU generates 
scenario data St7, which encodes such information as 
the location, length, and time-base relationship between 
edit objects in the selected parts of multimedia source 
data St1 , St3, and St5. It should be noted that generat- 
ing the scenario data St7 is described in detail below 
together with the structure and operation of the editing 
information generator 100 while referring to Fig. 37, Fig. 
38. Fig. 39, and Fig. 40. 

The source stream buffer has a specific capacity, 
and outputs after delaying the multimedia source data 



streams St1, St3, and St5 a specific time Td. This delay 
is required because a certain time Td is needed to 
determine the editing process contents of the multime- 
dia source data based on the scenario data St7 when 

5 encoding is done at the same time the scenario data 
St7 is generated, that is, when a sequential encoding 
process is used. It is therefore necessary during the 
actual edit encoding process to delay the multimedia 
source data for time Td for synchronization with edit 

70 encoding. The delay time Td used in this sequential 
editing process only needs to be long enough to resyn- 
chronize the various elements of the system, and the 
source stream buffer therefore normally comprises 
semiconductor memory or other high speed recording 

is medium. 

When the multimedia source data is encoded at 
one time after the scenario data St7 is generated for the 
entire title, that is, when the multimedia source data is 
encoded during batch encoding, the delay time Td must 

20 be at least as long as one title. In this case, the source 
stream buffer can be a low speed, high capacity storage 
medium such as video tape, magnetic disk, or optical 
disk. More specifically, the source stream buffer can be 
constructed using an appropriate storage medium 

25 according to the delay time Td and manufacturing cost. 
The encoding system controller 200 is connected to 
the editing information generator 100, and receives the 
scenario data St 7 from the editing information generator 
100. The encoding system controller 200 generates the 

30 encoding parameter data, and the encoding starting 
and stop timing signals St9, St11, and St 13 for edit 
objects in the multimedia source data based on informa- 
tion relating to the time-base position and length of edit 
content contained in the scenario data St7. The encod- 

35 ing system controller 200 also encodes the scenario 
data St7 to edit control command data St7 at the small- 
est editing unit level of the bitstream that is valid for the 
authoring encoder EC, and feeds the edit control com- 
mand data St7' back to the editing information generator 

40 100. Note the process for generating this edit control 
command data St7 is described below with reference to 
Fig. 37. 

As thus described, the multimedia source data St1 , 
St3, and St5 are synchronized with the timing signals 

45 St9, St1 1. and St13 because St1, St3, and St5 are out- 
put after being delayed time Td by the source stream 
buffer. More specifically, signal St9 is a video encoding 
signal, and thus specifies the encoding timing of the 
video stream St1 ; the video encoding signal St9 is thus 

50 used to extract the part of video stream St1 to be 
encoded, and generate the video encoding unit. Signal 
St1 1 is likewise a subpicture stream encoding signal for 
declaring the timing at which the subpicture stream St3 
is encoded to generate a subpicture encoding unit. In 

55 addition, signal St13 is an audio encoding signal for 
declaring the timing at which the audio stream St5 is 
encoded to generate an audio encoding unit. 

The encoding system controller 200 also generates 
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timing signals St21 , St23, and St25 for arranging the 
encoded multimedia encoding streams in a particular 
mutual relationship based on the streams St1 , St3, and 
St5 of the multimedia source data contained in the sce- 
nario data St7. 5 

The encoding system controller 200 also generates 
stream encoding data St33 for a title editing unit (VOB) 
of each title in one video zone VZ. The stream encoding 
data St33 indicates encoding parameters for system 
encoding, that is. multiplexing the video, audio, and sub- w 
picture multimedia encoding streams, and the reproduc- 
tion time information IT, which is indicative of the 
reproduction presentation time of a title editing unit 
(VOB). 

The encoding system controller 200 furthermore is 
generates an formatting command signal St39 defining 
the formatting parameters for formatting the title editing 
units (VOB) as a multimedia bitstream MBS based on 
the title editing units (VOB) of each stream where the 
title editing units (VOB) are in a specific mutual time- 20 
base relationship. The formatting command signal St39 
is used for generating connections between the title 
editing units (VOB) of each title in a multimedia bit- 
stream MBS, or interleaved title editing units (VOBs) for 
interleaving the title editing units. 25 

The video encoder 300 is connected to the source 
stream buffer of the editing information generator 100 
and to the encoding system controller 200, and receives 
therefrom the video stream St1, encoding parameter 
data for video encoding, and encoding start/stop timing 30 
signals St9. This parameter data includes, for example, 
the encoding start/stop timing, bit rate, encoding condi- 
tions for starting and stopping encoding, and type of 
content information; the type of content information 
includes, for example, whether the signal is formatted 35 
as an NTSC signal, PAL signal, or telecine content 
Based on the video encoding signal St9, the video 
encoder 300 encodes a specific part of the video stream 
St1, and thereby generates the video encoding stream 
St15. 40 

The subpicture encoder 500 is connected to a 
source buffer of the editing information generator 100, 
and to the encoding system controller 200, and receives 
therefrom the subpicture stream St3 and the subpicture 
stream encoding signal St11. Based on the parameter 45 
signal St1 1 for subpicture stream encoding, the subpic- 
ture encoder 500 encodes a specific part of the subpic- 
ture stream St3, and thereby generates the subpicture 
encoding stream St17. 

The audio encoder 700 is also connected to a so 
source buffer of the editing information generator 100, 
and to the encoding system controller 200, and receives 
therefrom the audio stream St5 and audio encoding sig- 
nal St13. Based on the audio encoding parameter data 
and encoding start/stop timing signal St 13, the audio ss 
encoder 700 encodes a specific part of the audio 
stream St5, and thereby generates the audio encoding 
stream St19. 



The video stream buffer 400 is connected to the 
video encoder 300, and stores the video encoding 
stream St15 output from the video encoder 300. The 
video stream buffer 400 is also connected to the encod- 
ing system controller 200, and based on timing signal 
St21 outputs the stored video encoding stream St1 5 as 
time-base adjusted video encoding signal St27. 

In the same manner, the subpicture stream buffer 
600 is connected to the subpicture encoder 500, and 
stores the subpicture encoding stream St1 7 output from 
the subpicture encoder 500. The subpicture stream 
buffer 600 is further connected to the encoding system 
controller 200, and based on the input timing signal 
St23 outputs the stored subpicture encoding stream 
Stl7 as the time-base adjusted subpicture encoding 
stream St29. 

Furthermore, the audio stream buffer 800 is con- 
nected to the audio encoder 700, and stores the audio 
encoding stream St19 output from the audio encoder 
700. The audio stream buffer 800 is further connected 
to the encoding system controller 200, and based on the 
input timing signal St25 outputs the stores audio encod- 
ing stream St1 9 as the time-base adjusted audio encod- 
ing stream St31. 

The system encoder 900 is connected to the video 
stream buffer 400, subpicture stream buffer 600, and 
audio stream buffer 800, and receives therefrom the 
time-base adjusted video encoding signal St27, time- 
base adjusted subpicture encoding stream St29, and 
time-base adjusted audio encoding stream St31. The 
system encoder 900 is further connected to the encod- 
ing system controller 200, and receives therefrom the 
system encoding stream St 33. 

Based on the system encoded encoding parameter 
data and encoding start/stop timing signal St33, the 
system encoder 900 applies a multiplex encoding proc- 
ess to the time-base adjusted streams St27, St29, and 
St31 to generate a title editing unit (VOB) St35. 

The video zone formatter 1300 is connected to the 
system encoder 900 from which a title editing unit St35 
is input. The video zone formatter 1300 is further con- 
nected to the encoding system controller 200 from 
which the formatting parameter data and formatting 
start/stop timing signal St39 are input for formatting a 
multimedia bitstream MBS. Based on the title editing 
unit St39, the video zone formatter 1300 rearranges the 
title editing units St35 for one video zone VZ into a 
sequence conforming to the user-defined scenario, and 
thereby generates an edited multimedia bitstream St43. 

This multimedia bitstream St43 edited to user- 
defined scenario content is then transferred to a record- 
ing section 1200. The recording section 1200 processes 
the edited multimedia bitstream MBS to data St43 in a 
format corresponding to the recording medium M, and 
records to the recording medium M. Note that the multi- 
media bitstream MBS contains the volume file structure 
VFS, which indicates physical addresses on the record- 
ing medium generated by the video zone formatter 
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1300. 

The encoded multimedia bitstream St35 can also 
be output directly to a decoder as described below for 
reproduction of the edited title content. It will be obvious 
that in this case the multimedia bitstream MBS does not s 
contain a volume file structure VFS. 

< 1.3)Scenario data St7 generation 

The method of generating the scenario data St7 
described above is described next below with reference 
to Fig. 37 in which a configuration of the editing informa- 
tion generator 100 is shown. As shown in Fig. 37, the 
editing information generator 100 comprises an editing 
information input means 102, editing information con- 
verter 104, authoring editor parameter setting means 
110, editing information display 106, and stream input 
buffer 109. The editing objects, that is, the video, sub- 
picture, and audio source streams St1 , St2, and St3 are 
temporarily stored synchronized to the edit timing in the 
stream input buffer 1 09. 

The editing information input means 102 is a means 
for inputting editing instructions reflecting user intent to 
the authoring encoder EC. 

The editing information converter 104 is a means 
for converting the editing instruction signal input by the 
user by means of the editing information input means 
102, and generating the scenario data St7. 

The authoring editor parameter setting means 110 
outputs a synchronization signal St300 for synchroniza- 
tion with the source stream edit timing to the stream 
input buffer 109. 

The authoring editor parameter setting means 110 
also determines whether the encoded user editing com- 
mand content conforms to the configuration and func- 
tional limitations of the multimedia bitstream data format 
of the multimedia bitstream used in the authoring sys- 
tem, the authoring encoder EC, and the authoring 
decoder DC described below, that is, satisfies the 
authoring edit parameter conditions. The authoring edi- 
tor parameter setting means 110 combined with the 
editing information converter 104 are equivalent to the 
CPU described above. It should be noted that the 
authoring editor parameter setting means 110 thus 
comprises the ability to present the source stream to the 
user in the smallest editing units possible. 

When the authoring editor parameter setting means 
1 10 determines that the user's editing control command 
data St7R satisfies the authoring edit parameter condi- 
tions, the authoring editor parameter setting means 110 
outputs the editing control command data St7R to the 
encoding system controller 200 as the scenario data 
St7 shown in Fig. 2. Based on this editing control com- 
mand data, the encoding system controller 200 gener- 
ates the authoring encoding parameters. Based on 
these authoring encoding parameters, the authoring 
encoder EC encodes the source streams St1, St3, and 
St5 to generate an authoring title bitstream. 



As a result of evaluating the authoring edit parame- 
ter conditions contained in the user's editing control 
command data St7R, the authoring editor parameter 
setting means 110 outputs the editing parameter data, 
that is, data indicative of the authoring edit parameters 
and the editing condition values, as editing information 
data St302 usable by the editing information display 
106. 

The editing information display 106 is equivalent to 
the above-described displayed, and by checking the 
editing parameters and editing condition values shown 
on the display, a user can reenter correct editing com- 
mands using the editing information input means 102. 

An authoring encoder EC comprising an editing 
information generator 100 with a different configuration 
from that shown in Fig. 37 is described next below with 
reference to Fig. 2 and Fig. 38. In the editing information 
generator 100 shown in Fig. 38, the authoring editor 
parameter setting means 1 10 of the editing information 
generator 100 shown in Fig. 37 is replaced by an editing 
information converter 111. In addition, the editing infor- 
mation converter 104 outputs the. scenario data St7 
simultaneously to this editing information converter 1 1 1 
and the encoding system controller 200. Based on the 
scenario data St7, the encoding system controller 200 
generates editing control command data St7 in the 
same manner as the authoring editor parameter setting 
means 110 shown in Fig. 37, and determines whether 
the encoded user editing command content satisfies the 
conditions of the authoring edit parameters. The encod- 
ing system controller 200 then embeds data indicative 
of the result of this evaluation in the editing control com- 
mand data St7', which it outputs as shown in Fig. 2 to 
the editing information convenor 1 1 1 in the editing infor- 
mation generator 100. 

The editing information converter 1 1 1 outputs edit- 
ing information data St302 as scenario data St7 simi- 
larly to the authoring editor parameter setting means 
1 10 (Fig. 37). In addition, the editing information conver- 
ter 1 1 1 includes the evaluation result contained in the 
edit control command data SIT input from the encoding 
system controller 200 in the editing information data 
St302 output from the editing information converter 111. 

Next, the operation of the editing information gener- 
ator 100 shown in Fig. 37 is described first with refer- 
ence to the flow chart in Fig. 39. 

At step #201, the user enters desired title editing 
instructions using the editing information generator 100 
for the source streams St1, St3, and St5 displayed on 
the editing information display 106. 

At step #202, the editing information converter 104 
generates the scenario data St7, the authoring editor 
parameter setting means 110 generates the editing 
parameter data St302, and the editing information dis- 
play 106 presents the content of the editing instructions 
for the user. 

At step #203, the authoring editor parameter setting 
means 110 generates editing command data St7R 
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based on the scenario data St7. 

At step #204, it is determined whether editing com- 
mand data St7R satisfies the authoring editor parame- 
ter conditions. If it is determined that the conditions are 
met, the editing control command data St7R is output as s 
the scenario data St7 (Fig. 2) to the encoding system 
controller 200. 

However, if it is determined in step #204 that the 
authoring editor parameter conditions are not met, the 
editing information data St302 is output to the editing 10 
information display 106, and control loops back to step 
#201. The loop of step #201, #202, #203, and #204 is 
thus repeated until the editing control commands input 
by the user satisfy the authoring editor parameter condi- 
tions. 15 

At step #205, the encoding system controller 200 
generates the authoring encoding parameters based on 
this editing control command data. The authoring 
encoder EC then encodes the St1, St3, and St5 based 
on the authoring encoding parameters to generate an 20 
authoring title bitstream. 

The operation of a editing information generator 
100 as shown in Rg. 38 is also basically the same as 
the flow chart shewn in Rg. 39, except that in steps 
#203 and #204 the encoding system controller 200 (Rg. 25 
2), and not the authoring editor parameter setting 
means 110 (Rg. 37), generates edit control command 
data StT based on scenario data St7, and determines 
whether the authoring editor parameter conditions are 
met. 30 

Authoring encoder parameters exemplary of the 
present invention are described briefly next. To simplify 
this description, the authoring encoder parameters are 
assumed below to comprise a group of four parameters: 
SJTIME, which is indicative of the start and end times of 35 
material corresponding to a scene; SRS_KIND, which is 
indicative of the type of material; SN_KIND, which is 
indicative of the type of scene; and SN_PBI, which is 
indicative of the presentation information of the scene. 
The start and end times of material corresponding to a <o 
scene (SJTIME) is information about the start time and 
the end time of an input source for scene encoding; on 
a digital VCR tape, for example, this would be the 
encoding start and stop times of the tape source. The 
material type (SRS_KIND) is information indicating 45 
whether the material is telecine material or not The 
scene type (SN _KiND) indicates whether the scene is a 
multiangle scene, or a parental lock scene. The presen- 
tation order information of a scene (SN_PBI) is informa- 
tion that indicates what title information is used in a so 
scene, and the presentation sequence of that title infor- 
mation. If a scene is used in a plurality of titles, scene 
presentation information SN_PBI is generated for that 
scene for each title in which it is used. It will be obvious 
that more than these four parameters will be generated 55 
depending upon the edited content. 



( 1 .4 ) Authoring decoder DCD 

Referring next to Rg. 3, a preferred embodiment of 
an authoring decoder DC when a multimedia bitstream 
authoring system according to the present invention is 
applied to a DVD system as described above. An 
authoring encoder DCD (hereafter, a "DVD decoder") 
appropriate to a DVD system decodes a multimedia bit- 
stream MBS edited by a DVD encoder ECD according 
to the present invention to present the content of each 
title according to a user-selected scenario. It should be 
noted that in the present embodiment of the invention a 
multimedia bitstream St45 encoded by a DVD encoder 
EC is recorded to a recording medium M. 

An authoring decoder DCD comprises a multimedia 
bitstream reproduction unit 2000, a scenario selector 
2100, a decoding system controller 2300, a stream 
buffer 2400, a system decoder 2500. a video buffer 
2600, a subpicture buffer 2700, an audio buffer 2800, a 
synchronization controller 2900, a subpicture decoder 
3100, an audio decoder 3200, a reordering buffer 3300, 
and a selector 3400, synthesizer 3500, video data out- 
put terminal 3600, an audio data output terminal 3700, 
and a video decoder 3801 . 

Note that the selector 3400 is connected to the syn- 
chronization controller 2900, and receives therefrom 
input of selection command signal St 103. 

The multimedia bitstream reproduction unit 2000 
comprises a recording medium drive unit 2004 for driv- 
ing the recording medium M, a reading head unit 2006 
for reading information recorded to the recording 
medium M and generating a binary read signal St57, a 
signal processing unit 2008 for generating a reproduc- 
tion bitstream St61 by variously processing the read sig- 
nal ST57, and a controller 2002. The controller 2002 is 
connected to the decoding system controller 2300 from 
which it receives a multimedia bitstream reproduction 
command signal St53, and generates reproduction con- 
trol signals St55 and St59 for controlling, respectively, 
the recording medium drive unit (motor) 2004 and signal 
processing unit 2008. 

So that the user-selected parts of the video, subpic- 
ture, and audio components of the multimedia title 
edited by the authoring encoder EC are reproduced, the 
decoder DCD comprises a scenario selector 2100 
whereby scenario data can be output to appropriately 
instruct the authoring decoder DC so that the corre- 
sponding scenario is selected and reproduced. 

The scenario selector 2100 preferably comprises a 
keyboard and CPU. Based on the content of a scenario 
entered using the authoring encoder EC, the user uses 
the keyboard to enter a particular desired scenario. 
Based on the keyboard input, the CPU then generates 
scenario selection data St51 indicating a selected sce- 
nario. The scenario selector 2100 is connected to the 
decoding system controller 2300 by an infrared commu- 
nications device or other means. 

The decoding system controller 2300 generates 
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scenario selection data St51, and based thereon the 
decoding system controller 2300 generates a reproduc- 
tion instruction signal St53 for controlling operation of 
the multimedia bitstream reproduction unit 2000. The 
decoding system controller 2300 further extracts the 5 
user's reproduction instruction information from the sce- 
nario data St53, and generates the decoding informa- 
tion table required for decoding control. The decoding 
information table is described in detail below with refer- 
ence to Fig. 32 and Fig. 33. The decoding system con- 
troller 2300 also extracts title information recorded to 
the recording medium M from file data area FDS infor- 
mation in the stream reproduction data St63, and 
thereby generates title information St200; the extracted 
title information includes video manager VMG, VTS 
information VTSI, PGC information C.PBI #j, and cell 
playback time (C_PBTM). 

( 1.5>Scenario selection data St51 generation 

The scenario selector 2100 is described next in 
detail below with reference to Fig. 40. The scenario 
selector 2100 comprises a scenario selection informa- 
tion entering means 2102, a scenario selection informa- 
tion converter 2104, and a title information display unit 
2106. 

The scenario selection information entering means 
2102 accepts scenario selection data input by a user- 
operable means such as a remote control device, or as 
a computer file or other type of electronic data contain- 
ing scenario selection information. 

The scenario selection information converter 2104 
converts information from the scenario selection infor- 
mation entering means 2105 to coding information St51 
that can be processed by the decoding system control- 
ler 2300. The title information display unit 2106 is a title 
information display unit for presenting the reproduced 
title information St200 (Fig. 3). For example, when a plu- 
rality of titles are contained on a disk, the title informa- 
tion display unit 2106 presents image information 
needed for title selection on a remote control or other 
device. It should be further noted that this title informa- 
tion display unit 2106 is not necessarily part of scenario 
selector 2100, and the image information needed for 
title selection can be presented on a CRT or other dis- 
play apparatus connected to the video data output ter- 
minal 3600. 

Returning to Fig. 3, the stream buffer 2400 has a 
particular buffer capacity, and temporarily stores repro- 
duction signal bitstream St61 input from the multimedia 
bitstream reproduction unit 2000, and extracts the 
stream address information and initial synchronization 
value information to generate stream control data St63. 
The stream buffer 2400 is connected to the decoding 
system controller 2300, and decodes the generated 
stream control data St63 for supply to the decoding sys- 
tem controller 2300. 

The synchronization controller 2900 is connected to 



the decoding system controller 2300, receives there- 
from the initial synchronization value information (SCR), 
sets an internal system clock (STC), and supplies a 
reset system clock St79 to the decoding system control- 
ler 2300. 

The decoding system controller 2300 generates a 
stream read signal St65 at a specific interval based on 
system clock St79, and supplies the stream read signal 
St65 to the stream buffer 2400. 

The data read unit in this case is a packet. The 
decoding system controller 2300 compares the SCR in 
the stream control data extracted from the stream buffer 
2400 with the system clock St79 from the synchroniza- 
tion controller 2900; when the system clock St79 
exceeds the SCR in St63, the decoding system control- 
ler 2300 generates read request signal St65. Note that 
this read control operation is applied in packet units, and 
thus controls packet transfers. 

Based on read signal St65, the stream buffer 2400 
outputs reproduction bitstream St61 at a specific inter- 
val. 

The decoding system controller 2300 also gener- 
ates a decoding stream instruction signal St69 based 
on scenario selection data St51 . and outputs the decod- 
ing stream instruction signal St69 to the system 
decoder 2500. The decoding stream instruction signal 
St69 indicates the stream IDs of the video, subpicture, 
and audio streams corresponding to the selected sce- 
nario. - ■ 

Based on the instructions in the decoding stream 
instruction signal St69, the system decoder 2500 sepa- 
rately outputs the video, subpicture, and audio streams 
input from the stream buffer 2400 to the video buffer 
2600, subpicture buffer 2700, and audio buffer 2800 as 
video encoding stream St71, subpicture encoding 
stream St73, and audio encoding stream St75, respec- 
tively. 

When a title contains a plurality of audio data 
streams for audio in different languages, for example 
Japanese, English, and French dialogues, and a plural- 
ity of subpicture data streams for subtitles, for example, 
in different languages, for example Japanese subtitles, 
English subtitles, and French subtitles, an ID is 
assigned to each stream. That is, as described with ref- 
erence to Fig. 7, a stream ID is assigned to video data 
and MPEG audio data, and a substream ID is assigned 
to subpicture data, AC3 audio data, linear PCM, and 
navigation pack NV data. While the user is not aware of 
the ID, the user selects the language in which to present 
the audio or subtitles using the scenario selector 2100. 
If English audio is selected, the ID corresponding to the 
English audio is transferred to the decoding system 
controller 2300 as scenario selection data St51. The 
decoding system controller 2300 then forwards that ID 
to the system decoder 2500 in St69. 

Based on the instructions in the decoding stream 
instruction signal St69, the system decoder 2500 sepa- 
rately outputs the video, subpicture, and audio streams 
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input from the stream buffer 2400 to the video buffer 
2600, subpicture buffer 2700, and audio buffer 2800 as 
video encoding stream St71, subpicture encoding 
stream St73, and audio encoding stream St75, respec- 
tively. That is, when the stream ID input from the see- s 
nario selector 2100 matches the packet ID transferred 
from the stream buffer 2400, the system decoder 2500 
transfers the packets to the corresponding buffers 
(video buffer 2600, subpicture buffer 2700, audio buffer 
2800). 

In addition, the system decoder 2500 detects the 
reproduction start time (PTS) and decoding start time 
(DTS) for the smallest control unit in each stream St67, 
and generates timing information signal St77. This tim- 
ing information signaJ St77 is passed through the 
decoding system controller 2300, and supplied to the 
synchronization controller 2900 as synchronization con- 
trol data StB1. 

Using the synchronization control data St81, the 
synchronization controller 2900 determines the decod- 
ing start timing for each stream so that after decoding 
the streams are in a particular sequence. Based on this 
decoding timing, the synchronization controller 2900 
generates and supplies a video stream decoding start 
signal St89 to the video decoder 3800. The synchroni- 
zation controller 2900 similarly generates and supplies 
a subpicture stream decoding start signal St91 and 
audio stream decoding start signal St93 to the subpic- 
ture decoder 3100 and audio decoder 3200, respec- 
tively. 

The video decoder 3801 produces and outputs a 
video output request signal St84 to the video buffer 
2600 based on the video stream decoding start signal 
St89. In response to the video output request signal 
St84, the video buffer 2600 outputs the video stream 
St83 to the video decoder 3801. The video decoder 
3801 detects the reproduction time information con- 
tained in the video stream St83, and nullifies the video 
output request signal St84 when the received bit length 
of the video stream St83 is equivalent to the reproduc- 
tion time. As a result, a video stream length correspond- 
ing to a particular reproduction time is decoded, by the 
video decoder 3801, and the reproduced video signal 
St95 is output to the reordering buffer 3300 and selector 
3400. 

The video encoding stream is encoded using inter- 
frame correlation, and when viewed in frame units, the 
presentation sequence and encoding stream sequence 
do not match. Presentation in the decoding sequence is 
therefore not possible. As a result, decoded frames are 
temporarily stored to the reordering buffer 3300. St103 
is controlled by the synchronization controller 2900 to 
obtain the presentation sequence, and switches the out- 
put to the synthesizer 3500 between output St95 of 
video decoder 3801 and the output from the reordering 
buffer St97. 

The subpicture decoder 3100 similarly generates 
and supplies subpicture output request signal St86 to 



the subpicture buffer 2700 based on the subpicture 
decoding start signal St91. The subpicture buffer 2700 
outputs the subpicture stream St85 to the subpicture 
decoder 3100 in response to the video output request 
signal St84. The subpicture decoder 3100 decodes a 
length of the subpicture stream St85 corresponding to a 
specific reproduction time based on the reproduction 
time information contained in the subpicture stream 
St85, and thus reproduces and outputs a subpicture sig- 
nal St99 to the synthesizer 3500. 

The synthesizer 3500 superimposes the output of 
selector 3400 and subpicture signal St99 to generate 
and output video signal St105 to the video output termi- 
nal 3600. 

The audio decoder 3200 generates and outputs 
audio output request signal St88 to the audio buffer 
2800 based on the audio decoding start signal St93. 
The audio buffer 2800 outputs audio stream St87 to the 
audio decoder 3200 in response to the audio output 
request signal St88. The audio decoder 3200 decodes a 
length of the audio stream St87 corresponding to a spe- 
cific reproduction time based on the reproduction time 
information contained in the audio stream St87, and 
outputs to the audio output terminal 3700. 

It is thus possible to reproduce a user-selected mul- 
timedia bitstream MBS in real-time in response to a 
user's scenario selection. That is, each time a user 
selects a different scenario, the title content desired by 
the user can be reproduced by means of the authoring 
decoder DCD reproducing a multimedia bitstream MBS 
corresponding to the selected scenario. 

It should be noted that the decoding system control- 
ler 2300 can supply title information signal St200 to the 
scenario selector 2100 by such means as the above- 
noted infrared communications device. The scenario 
selector 2100 can extract and present on an internal 
display the title information recorded to an optical disc M 
from the file data area FDS information in the stream 
reproduction data St63 contained in the title information 
signal St200, and thereby enable interactive scenario 
selection by the user. 

It should be further noted that the stream buffer 
2400, video buffer 2600, subpicture buffer 2700, audio 
buffer 2800, and reordering buffer 3300 are functionally 
different and are therefore represented above as sepa- 
rate buffers. However, a single memory buffer can be 
made functionally equivalent to these separate buffers 
by using on a time-share basis a memory buffer that 
operates at several times the read/write rate required by 
these separate buffers. 

As described above, an authoring system accord- 
ing to the present invention can generate a multimedia 
bitstream conforming to a plurality of selectable scenar- 
ios by real-time or batch encoding multimedia source 
data such that a plurality of branchable substreams of 
the smallest content editing units of the basic title con- 
tent are arranged in a specific time-base relationship. 

In addition, a multimedia bitstream encoded in this 
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manner can be reproduced according to a desired sce- 
nario selected from among the plural scenarios. During 
reproduction, it is also possible to select (switch to) a 
scenario different from the selected scenario, and 
(dynamically) reproduce a multimedia bitstream corre- 5 
sponding to the newly selected scenario. It is also pos- 
sible to dynamically select and reproduce a particular 
scene from among a plurality of scenes during repro- 
duction of title content following a selected scenario. 

As thus described, an authoring system according 10 
to the present invention can encode and not only repro- 
duce but repeatedly reproduce a multimedia bitstream 
MBS in real-time. An authoring system according to the 
present invention described above is described below 
with application to a DVD system, and particularly to is 
seamless connection editing. 

< 2.1 >Parerttal lock control and murtiangle control 

Parental lock control and murtiangle control in the 20 
present invention are described below. The presenta- 
tion time of video data from which images have been cut 
according to the viewer level setting for parental lock 
control is shorter than that of the other video data. In 
addition, the length of video data that is alternatively 25 
reproduced according to parental lock control varies. 

In multiangle control, the reproduction time of each 
video data option within a particular angle view is the 
same because the audio and video parts of each angle 
view are reproduced parallel to the same time base. It is 30 
important to note here that "reproduction time" is used 
in a broad sense, and refers to the time during which 
data is read from disc, converted to audio and video sig- 
nals, and audio and video are output. It is not to be nar- 
rowly interpreted and understood to mean only the time 35 
for reading from disc, or only the time required to finish 
decoding the read data. Furthermore, even though the 
reproduction time within a particular angle view is the 
same, the video content differs. As a result, the data 
quantity compressed by the MPEG-2 method is varia- 40 
ble, and the data quantity in each converted angle view 
will, of course, also vary. 

One technology that is a background to parental 
lock control and multiangle control is alternative repro- 
duction of common video data. A problem in implement- 45 
ing alternative reproduction is sustaining video data 
reproduction without interruption between the selected 
video data segments. However, when video data is 
arranged in a continuous recording area on an optical 
disc, only one video data segment can be placed adja- so 
cent to the preceding and following video data seg- 
ments. A data seek operation is therefore required to 
reproduce non-adjacent video data, and continuous 
video reproduction is interrupted. 

The delay imposed by a seek operation of a certain 55 
duration can be absorbed by causing the disc reproduc- 
tion apparatus to buffer reproduction data during repro- 
duction. However, when the alternative reproduction 



period is long, the buffer capacity required by the disc 
reproduction apparatus increases, and may be 
exceeded. A buffer overflow occurs when this happens. 
To reduce the delay time of any seek operation, a tech- 
nology known as "interleaved recording" is used. 

In interleaved recording the video data is seg- 
mented into interleave units of a particular size, and 
alternative reproduction periods are formed by arrang- 
ing these interleave units in an alternating pattern in the 
continuous recording area of the optical disc. The seek 
distance of any seek operation can thus be suppressed 
to the size of the interleave unit, and reproduction can 
be sustained without causing a buffer overflow even 
when the alternative reproduction period continues for 
an extended period of time. A reproduction method that 
can be used for parental lock control and multiangle 
control without a buffer overflow occurring is described 
below. 

< 2.2 >Segmenting an alternative reproduction period for 
parental lock control 

When forming an alternative reproduction period for 
parental lock control, it is necessary to form the seg- 
ments both so that a buffer overflow as described above 
does not occur, and so that a buffer underflow, which 
occurs when the video data that should be reproduced 
is not in the buffer, does not occur in the disc reproduc- 
tion apparatus. The performance of the disc reproduc- 
tion apparatus, including the buffer size and the seek 
speed, must be considered to form a alternative repro- 
duction period so that a buffer overflow does not occur. 

That is, if JMP is the maximum jump time of the disc 
reproduction apparatus that will not result in video inter- 
ruption, the minimum Mmin number of segments M of 
the video data divided into interleave units to form an 
alternative reproduction period is defined by the follow- 
ing equation 

Mmin * VOBMaxTime/JMT (1) 
where JMT is derived from the buffer size and seek per- 
formance, and VOBMaxTlme is the longest video data 
segment associated with the alternative reproduction 
period. With equation (1), the number of interleave units 
into which video data forming an alternative reproduc- 
tion period is segmented must be a value greater than 
or equal to M, in which case a problem with the buffer 
overflowing occurs during interleave unit reproduction, 
and if there is a jump, a buffer underflow occurs 
because there is not sufficient data stored in the buffer. 

In addition, to prevent a buffer underflow from 
occurring, the jump must be completed by the time 
decoding the video data in one interleave unit present in 
the buffer before the jump is completed, and the video 
data of the interleave unit decoded next after the jump 
must be completely reproduced in the buffer. 

That is, if JT is the time needed for a jump (for 
example, 0.4 sec), BitRate is the data input transfer rate 
to the track buffer of the reproduction apparatus (for 
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example, a constant 10 Mbps), and ILUM is the data 
size of an interleave unit, the reproduction time (ILUMT) 
assured by video data in the buffer must be greater than 
longest time needed to prepare for decoding the next 
video data when a jump occurs, and must satisfy the fol* 5 
lowing equation. 

ILUMT > JT + (ILUM / BitRate) (2) 
The number of segments M of the alternative repro- 
duction period can thus be obtained by the following 
equation. 10 

M = VOBTime / ILUMT (3) 
where VOBTime is the reproduction time of each video 
data block in the alternative reproduction period. 

If the minimum value of video data in the alternative 
reproduction period is VOBminTime, the maximum 15 
Mmax number of interleave unit segments M is 
expressed by the following equation. 

Mmax s VOBminTime / (JT + 
(ILUM/BrtRate)) (4) 

That is, if the maximum Mmax number of interleave unit 20 
segments M is less than maximum segments Mmax, a 
buffer underflow problem will occur in the disc reproduc- 
tion apparatus. 

Thus, as described above, the video data must be 
segmented into M values (Mmin, Mmax) satisfying 25 
equations (1), and (4) when forming an alternative 
reproduction period. 

( 2.3 >An example of limit values of an alternative repro- 
duction period for parental lock control 30 

A specific example of control values used in an 
alternative reproduction period for parental lock control 
are shown below. 

Using the values: 35 

VOB maximum bit rate: 8 Mbps 
VOBminTime: 4.0 sec 

VOBTime: 1 20 sec min 

40 

ILUMT > 0.4 + 8*2/10 = 2.0 (sec), based on equa- 
tion 2, 

JMT = 250 MBIT/8 = 31.25 (sec), based on equa- 
tion 1, 

Mmax <, VOBminTime/ILUMT = 4.0/2.0 = 1, from 45 
equation 4, 

M = VOBTime/JMT = 120/31.25 = 3.84, from equa- 
tion 3. 

In this case, the values obtained from equations 3 and 4 50 
are mutually incompatible, and an error results. 

< 2.4 dividing an alternative reproduction period for 
angles 

55 

When forming alternative reproduction periods for 
multi-angle control, the number of segments in the 
period must be determined with consideration given to 



the buffer overflow problem described above, while at 
the same time assuring that the video data is structured 
with an even distribution of video data. This is because, 
unlike with parental lock control, it must be possible to 
dynamically change the video in response to user 
instructions during reproduction. If the length of the 
video data varies, it is not possible to synchronize with 
the video after the angle is changed, and aurfo presen- 
tation may be interrupted. 

For the video data structure to be the same before 
and after an angle change, the GOP structures consti- 
tuting the image data in the video data must be equal. 
Note that "equal GOP structures" as used here means 
that the number of GOP and the number of pictures in 
each GOP contained in the corresponding interleave 
units are the same. That is, encoding the elementary 
streams containing the video information from which the 
video data constituting an angle view is generated must 
be controlled to satisfy these limitations during elemen- 
tary stream encoding. 

A typical example of when this problem occurs is 
when the source material is film (24 frames/sec) that 
has been telecine converted. Telecine conversion is a 
process whereby a film source, which is recorded at 24 
frames per second, is converted to a conventional tele- 
vision signal requiring field interpolation to achieve the 
required 30 frames per second. When telecine con- 
verted material is then converted to MPEG data, the 
fields inserted by interpolation are redundant. There- 
fore, to improve the compression rate, reverse telecine 
conversion is applied to restore the original 24 fps rate 
before compression as MPEG data. 

However, while this reverse telecine conversion 
reduces the overall number of frames from 30 to 24, the 
video data may not be uniformly converted at this rate, 
and may be locally converted at a variable rate between 
24 to 30 frames per second. Applying this reverse tele- 
cine conversion process to video data that is part of an 
angle view may result in a picture count that is not con- 
stant and the GOP structure of each VOB may then dif- 
fer. 

The picture count which is one condition of equal 
GOP structures, is described in detail below. 

The picture count as used herein refers more pre- 
cisely to the number of reproduced pictures contained in 
a GOP. Furthermore, the encoded frames of a GOP and 
the number of reproduced pictures do not necessarily 
match. When a source signal is encoded from 30 fps to 
24 fps during reverse telecine conversion, the number 
of pictures clearly decreases. When this happens, infor- 
mation indicating which field(s) were removed, and 
when the fields should be reproduced, is written to the 
encoded stream as the RFF flag and TFF flag. (RFF: 
Repeat First Field; TFF: Top Field First). As a result if 
the picture count for which said flag (RFF) is on differs, 
the reproduction time will differ even in GOPs having the 
same number of encoded frames. 

In addition, the number of audio data channels and 
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the number of subpicture data channels contained in 
the video data must also be equal. This is because if the 
number of channels differs, a channel selected before 
the video data changed may not exist after the video 
data is changed. That is, it is necessary during elemen- 5 
tary stream encoding to encode only those parts of the 
audio data or subpicture data providing the same 
number of channels. 

The data structure of a DVD system related to the 
present invention is described further below with refer- 10 
ence to Rg. 5, Fig. 6, Fig. 7, and Rg. 8. 

<2.5>Multiseene control 

If the requirements of parental lock control playback 15 
and multi-angle control playback described above are 
satisfied by individually recording a separate title with 
the content required to present every possible scenario, 
the same number of titles as possible scenarios must be 
prepared and recorded to the recording medium with 20 
each title containing substantially the same scene data 
with only slight differences therebetween. This means 
that the same data will be repeatedly recorded to a large 
part of the recording medium, significantly impairing the 
utilization efficiency of the available storage capacity of 25 
the recording medium. Furthermore, even if the storage 
capacity of the recording medium is very large, as it is 
on DVD media, it will be impossible to record enough 
titles for every possible request. While this problem can 
be considered solvable by increasing the capacity of the 30 
recording medium, this is not particularly desirable 
when effective use of available system resources is con- 
sidered. 

Using multiscene control, which is described in out- 
line below, in a DVD system, titles that can be repro- 35 
duced according to many different scenarios can be 
assembled using a minimal amount of data, thereby 
enabling effective use of recording media and other sys- 
tem resources. 

More specifically, titles that can be reproduced 40 
according to many different scenarios are assembled 
from basic scene periods comprising data common to a 
plurality of titles, and multiscene periods comprising the 
groups of scenes that vary according to the selected 
scenario. During reproduction, the user can freely and 45 
at any time select a specific scene in any of the multi- 
scene periods. It should be noted that multiscene con- 
trol in the cases of parental lock control playback and 
multi-angle control playback is described below with ref- 
erence to Rg. 9. so 

{ 3.1 )Data structure in a DVD system 

The internal structure of a video title set VTS as 
shown in Fig. 1 is shown in Fig. 5. A video title set VTS ss 
comprises VTS information (VTSI) representing the 
management information for the entire disc, and VOBS 
for VTS titles (VTSTT_VOBS), which is the system 



stream of the multimedia bitstream. The VTS informa- 
tion is described first below, and the VOBS for a VTS 
title is then described. 

VTS information comprises, primarily, a VTSI man- 
agement table (VTSLMAT), and VTSPGC information 
table (VTS_PGCIT). 

the VTSI management table records the internal 
format of the video title set VTS, the number of selecta- 
ble audio streams contained in the video title set VTS, 
the number of subpictures, and the location where the 
video title set VTS is stored. 

The VTSPGC information management table is a 
table recording / (where / is a natural number) PGC data 
VTS_PGCI#1 to VTS_PGCI#I indicative of the program 
chain (PGC) controlling the reproduction sequence. 
Each PGC information entry VTS_PGCI#I is informa- 
tion indicative of a program chain, and comprises j 
(where j is a natural number) cell reproduction data 
C_PBI#1 to C_PBI#j. Each cell reproduction data 
C_PBI#j contains control information related to the 
reproduction and reproduction sequence of a cell. 

Conceptually, a program chain PGC describes the 
story of a title, which is formed by recording the repro- 
duction sequence of the ceils (described below). When 
the VTS information is information related to a menu, for 
example, the VTS information is stored to a buffer in the 
reproduction apparatus when reproduction starts, and 
is referenced by the reproduction apparatus when the 
"menu" button on a remote control device is pressed 
during playback, thus causing the top menu selection 
#1, for example, to be presented. If a hierarchical menu 
structure is used, then program chain information 
vTS_PGCI#1 .for example, may be the main menu dis- 
played when the "menu" button is pressed, #2 to #9 may 
be submenus corresponding to numbers on the remote 
control keypad, and #10 and up may be submenus 
another layer down on the hierarchy. In another possible 
configuration, #1 may be a the top menu selection dis- 
played when the "menu" button is pressed, and #2 and 
up may be voice instructions reproduced when the cor- 
responding button is pressed on the numeric keypad. 

The menus are also expressed by a plurality of pro- 
gram chains specified in this table, and can be config- 
ured in any desired manner, whether they are 
hierarchical menus, menus containing voice instruc- 
tions, or menu design. 

In addition, if a movie is represented by the program 
chain, it is stored in the buffer of the reproduction appa- 
ratus when reproduction starts, and the reproduction 
apparatus references the reproduction sequence writ- 
ten to the PGC to reproduce the system stream, 

A "cell" as used herein is all or part of the system 
stream, and used as the access point for reproduction. 
For example, if a movie is recorded, a cell can be used 
as the chapters into which a title is segmented. 

ft should be noted that each of the entered PGC 
information C_PBI#j contains cell reproduction process- 
ing information and a cell information table. Reproduc- 
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tion processing information comprises the reproduction 
time, number of times a cell is repeated, and other 
processing information required for ceil reproduction. 
More specifically, the reproduction processing informa- 
tion includes the block mode (CBM), eel) block type 
(CBT), seamless playback flag (SPF), interleaved allo- 
cation flag (IAF), STC discontinuity flag (STCDF), cell 
presentation time (C_PBTM), seamless angle change 
flag (SACF), the first cell VOBU start address 
(C_FVOBUSA) ( and the last cell VOBU start address 
(C_LVOBU_SA). 

"Seamless reproduction" as used herein is the abil- 
ity in a DVD system to reproduce a multimedia source 
containing, for example, video, audio, and ancillary 
video information without interruption of the data or 
information. It is described in detail with reference to 
Fig. 10 and Fig. 11. 

The block mode CBM is indicative of a plurality of 
cells constitute one functional block; the cell reproduc- 
tion information of each cell in a functional block is con- 
secutively arranged in the PGC information. The CBM 
of the cell reproduction information placed at the begin- 
ning is a value indicating the "first ceil in the block"; the 
CBM of the cell reproduction information placed at the 
end is a value indicating the last cell in the block"; and 
the CBM of the reproduction information arranged 
between the first and last cells is a value indicating a 
"cell inside the block." 

The cell block type CBT is indicative of the type of 
block indicated by the block mode CBM. For example, 
when a multi-angle control function is enabled, cell infor- 
mation for reproducing each angle is defined as the 
functional block described above, and the block type 
CBT in the cell reproduction information of each cell in 
that block is then set to a value indicating an "angle." 

The seamless playback flag (SPF) indicates 
whether the corresponding cell is to be linked and 
played back seamlessly with the ceil or cell block repro- 
duced immediately therebefore. To seamlessly repro- 
duce a given cell with the preceding cell or cell block, 
the seamless playback flag SPF is set to 1 in the cell 
playback information for that cell; otherwise SPF is set 
toO. 

The interleaved allocation flag IAF stores a value 
identifying whether the cell is in an interleaved block If 
the cell is part of an interleaved block the interleaved 
allocation flag IAF is set to 1 ; otherwise it is set to 0. 

The STC discontinuity flag STCDCF identifies 
whether the system time clock STC used for synchroni- 
zation must be reset when the cell is played back; when 
resetting the system time clock STC is necessary, the 
flag is set to 1 ; otherwise it is set to 0. 

Tne seamless angle change flag SACF stores a 
value indicating whether a cell in a multi-angle period 
should be connected seamlessly at an angle change. If 
the angle change is seamless, the seamless angle 
change flag SACF is set to 1 ; otherwise it is set to 0. 

The cell presentation time (C_PBTM) expresses 



the cell presentation time with video frame precision. 

The last cell VOBU start address C_LVOBU_SA is 
the VOBU start address of the last cell in the block. The 
value of this address is expressed as the distance from 

s the logic sector of the first cell in the VTS title VOBS 
(VTSTT_VOBS) as measured by the number of sectors. 
The first cell VOBU start address C_FVOBU_SA is the 
VOBU start address of the first cell in a block, and is 
expressed as the distance from the logic sector of the 

w first cell in the VTS title VOBS (VTSTT_VOBS) as 
measured by the number of sectors. 

The VTS title VOBS, that is, one multimedia system 
stream data VTSTTVOBS, is described next. The sys- 
tem stream data VTSTT_VOBS comprises / (where / is 

15 a natural number) system streams SS, each of which is 
referred to as a Video object" VOB. Each video object 
VOB #1 to VOB #i comprises at least one video data 
block interleaved with up to a maximum eight audio data 
blocks and up to a maximum 32 subpicture data blocks. 

20 Each video object VOB comprises q (where q is a 
natural number) cells C #1 to C #q. Each cell C com- 
prises r (where r is a natural number) video object units 
VOBU #1 to VOBU #r. 

Each video object unit VOBU comprises plural 

25 GOP, and an equivalent time-length of audio and sub- 
pictures. Note that the GOP corresponds to the video 
encoding refresh cycle. Each video object unit VOBU 
also starts with an NV pack, that is, the control data for 
that VOBU. The structure of the NV packs is described 

30 with reference to Fig. 7. 

The system stream St35 encoded by the encoder 
EC described with reference to Fig. 12, that is, the inter- 
nal structure of the video zone VZ, is described with ref- 
erence to Fig. 6. Note that the video encoding stream 

35 St1 5 shown in the figure is the compressed one-dimen- 
sionaJ video data stream encoded by the video encoder 
300. The audio encoding stream St19 is likewise the 
compressed one-dimensional audio data stream multi- 
plexing the right and left stereo audio channels encoded 

40 by the audio encoder 700. Note that the audio signal 
shall not be limited to a stereo signal, and can also be a 
multichannel surround-sound signal. Note, further, that 
the audio encoding stream St 19 is not input as a single 
stream, but as three audio source streams St19A, 

45 St19B, and St19C. Thus, a total six compressed data 
streams are interleaved to a single system stream St35. 

The subpicture encoding stream St17, that is, the 
ancillary video data stream, is also input as two source 
streams St17A and St17B. Thus, a total six compressed 

so data streams are interleaved to a single system stream 
St35. 

The system stream St35 is a one-dimensional array 
of packs with a byte size corresponding to the logic sec- 
tors LS #n having a 2048-byte capacity. A stream con- 
55 trol pack is placed at the beginning of the system stream 
SOS, that is, at the beginning of the VOBU. This stream 
control pack is called the "navigation pack NV", and 
records the data format in the system stream and other 
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control information. 

The video encoding stream St15, the audio encod- 
ing streams St19A, St19B, and SU9C, and the subpic- 
ture encoding stream St17, and St17A and St17B, are 
packetized in byte units corresponding to the system s 
stream packs. These packets are shown in the figure as 

packets V1, V2, V3, V4... and Aa1, Aa2 Ab1, Ab2, 

.... Ac1, Ac2 Spa1, Spa2 and Spb1 t Spb2,.... As 

shown in Fig. 1003, these packets are interleaved in the 
appropriate sequence as system stream St35, thus 10 
forming a packet stream, with consideration given to the 
decoder buffer size and the time required by the 
decoder to expand the video and audio data packets. In 
the example shown in the figure, the packet stream is 
interleaved in the sequence V1, V2, Aa1.Ab1.Ac1, 15 
Spa1,Spb1. 

As a result of significantly increased recording/play- 
back capacity, high speed recording/playback, and per- 
formance improvements in the signal processing LSI in 
a DVD system, plural audio data streams and plural 20 
subpicture data streams (graphics data) can be 
recorded interleaved with a single video data stream as 
an MPEG system stream, thereby enabling the user to 
select the specific audio data and subpicture data to be 
reproduced during playback. 25 

The video data is encoded according to the MPEG 
specification with the GOP being the unit of compres- 
sion. In general, each GOP contains 15 frames in the 
case of an NTSC signal, but the specific number of 
frames compressed to one GOP is variable. The stream 30 
management pack, which describes the management 
data containing, for example, the relationship between 
interleaved data, is also interleaved at the GOP unit 
interval. Because the GOP unit is based on the video 
data, changing the number of video frames per GOP 35 
unit changes the interval of the stream management 
packs. This interval is expressed in terms of the presen- 
tation time on the digital video disk within a range from 
0.4 sec. to 1.0 sec. referenced to the GOP unit. If the 
presentation time of contiguous plural GOP units is less 40 
than 1 sec., the management data packs for the video 
data of the plural GOP units is interleaved to a single 
stream. 

These management data packs are referred to as 
navigation packs NV in the DVD system. The data from 45 
one NV pack to the packet immediately preceding the 
next NV pack forms one video object unit VOBU. In gen- 
eral, one contiguous playback unit that can be defined 
as one scene is called a video object VOB, and each 
video object VOB contains plural video object units so 
VOBU. Data sets of plural video objects VOB form a 
VOB set (VOBS). Note that these data units were first 
used in DVD. 

When a plurality of these data streams is inter- 
leaved, the navigation packs NV defining the relation- 55 
ship between the interleaved packs must also be 
interleaved at a defined unit known as the pack number 
unit. Each GOP is normally a unit containing approxi- 



mately 0.5 sec. of video data, which is equivalent to the 
presentation time required for 12 to 15 frames, and one 
stream management packet is generally interleaved 
with the number of data packets required for this pres- 
entation time. 

The stream management information contained in 
the interleaved video, audio, and subpicture data pack- 
ets constituting the system stream is described below 
with reference to Fig. 7. As shown in Fig. 7, the data 
contained in the system stream is recorded in a format 
packed or packetized according to the MPEG-2 stand- 
ard. The packet structure is essentially the same for 
video, audio, and subpicture data. One pack in the DVD 
system has a 2048 byte capacity as described above, 
and contains a pack header PKH and one packet PES; 
each packet PES contains a packet header PTH and 
data block. 

The pack header PKH records the time at which 
that pack is to be sent from stream buffer 2400 to sys- 
tem decoder 2500 (see Fig. 3), that is, the system clock 
reference SCR defining the reference time for synchro- 
nized audio-visual data playback. The MPEG standard 
assumes that the system clock reference SCR is the ref- 
erence clock for the entire decoder operation. With such 
disk media as the digital video disk, however, time man- 
agement specific to individual disk players can be used, 
and a reference clock for the decoder system is there- 
fore separately provided. 

The packet header PTH similarly contains a pres- 
entation time stamp PTS and a decoding time stamp 
DTS, both of which are placed in the packet before the 
access unit (the decoding unit). The presentation time 
stamp PTS defines the time at which the video data or 
audio data contained in the packet should be output as 
the playback output after being decoded, and the 
decoding time stamp DTS defines the time at which the 
video stream should be decoded. Note that the presen- 
tation time stamp PTS effectively defines the display 
start timing of the access unit, and the decoding time 
stamp DTS effectively defines the decoding start timing 
of the access unit. If the PTS and DTS are the same 
time, the DTS is omitted. 

The packet header PTH also contains an 8-bit field 
called the stream ID identifying the packet type, that is, 
whether the packet is a video packet containing a video 
data stream, a private packet, or an MPEG audio 
packet. 

Private packets under the MPEG-2 standard are 
data packets of which the content can be freely defined. 
Private packet 1 in this embodiment of the invention is 
used to carry audio data other than the MPEG audio 
data, and subpicture data; private packet 2 carries the 
PCI packet and DSi packet. 

Private packets 1 and 2 each comprise a packet 
header, private data area, and data area. The private 
data area contains an 8-bit sub-stream ID indicating 
whether the recorded data is audio data or subpicture 
data. The audio data defined by private packet 2 may be 
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defined as any of eight types #0 to #7 of linear PCM or 
AC-3 encoded data. Subpicture data may be defined as 
one of up to 32 types #0 to #31. The data area is the 
field to which data compressed according to the MPEG- 
2 specification is written if the stored data is video data; 
linear PCM, AC-3, or MPEG encoded data is written if 
audio data is stored; or graphics data compressed by 
run length coding is written if subpicture data is stored. 

MPEG-2-compressed video data may be com- 
pressed by constant bit rate (CBR) or variable bit rate 
(VBR) coding. With constant bit rate coding, the video 
stream is input continuously to the video buffer at a con- 
stant rate. This contrasts with variable bit rate coding in 
which the video stream is input intermittently to the 
video buffer, thereby making it possible to suppress the 
generation of unnecessary code. Both constant bit rate 
and variable bit rate coding can be used in the DVD sys- 
tem. 

Because MP EG video data is compressed with var- 
iable length coding, the data quantity in each GOP is not 
constant The video and audio decoding times also dif- 
fer, and the time-base relationship between the video 
and audio data read from an optical disk, and the time- 
base relationship between the video and audio data out- 
put from the decoder, do not match. The method of 
time-base synchronizing the video and audio data is 
therefore described in detail below, but is described 
briefly below based on constant bit rate coding. 

The navigation pack NV structure is shown in Fig. 
8. Each navigation pack NV starts with a pack header 
PKH, and contains a PCI packet and DSI packet. 

As described above, the pack header PKH records 
the time at which that pack is to be sent from stream 
buffer 2400 to system decoder 2500 (see Fig. 3), that is, 
the system clock reference SCR defining the reference 
time for synchronized audio-visual data playback. 

Each PCI packet contains PCI Information (PCLGI) 
and Angle Information for Non-seamless playback 
(NMSL_AGLI). 

The PCI Information (PCI_GI) declares the display 
time of the first video frame (VOBU_S_PTM), and the 
display time of the last video frame (VOBU_E_PTM), in 
the corresponding video object unit VOBU with system 
clock precision (90 kHz). 

The Angle Information for Non-seamless multi- 
angle playback (NMSL_AGLI) states the read start 
address of the corresponding video object unit VOBU 
when the angle is changed, expressed as the number of 
sectors from the beginning of the video object VOB. 
Because there are nine or fewer angles in this example, 
there are nine angle address declaration ceils 
(NMSL_AGL_C1_DSTA to NMSL_AGL_C9_DSTA). 

Each DSI packet contains DSI Information 
(DSI_GI), Seamless Playback Information (SML_PBI), 
and Angle Information for Seamless multi-angle play- 
back (SML_AGU). 

The DSI Information (DSI_GI) declares the address 
of the last pack in the video object unit VOBU 



(VOBU_EA) expressed as the number of sectors from 
the beginning of the video object unit VOBU. 

While seamless playback is described in detail 
later, it should be noted that the continuously read data 

5 units must be interleaved (multiplexed) at the system 
stream level as an interleaved unit ILVU in order to 
seamlessly reproduce split or combined titles. Plural 
system streams interleaved with the interleaved unit 
ILVU as the smallest unit are defined as an interleaved 

w block. 

The Seamless Playback Information (SML_PBI) is 
declared to seamlessly reproduce the stream inter- 
leaved with the interleaved unit ILVU as the smallest 
data unit The Seamless Playback Information 

15 (SML_PBI) contains an Interleaved Unit Rag (ILVU flag) 
identifying whether the corresponding video object unit 
VOBU is an interleaved block. The ILVU flag indicates 
whether the video object unit VOBU is in an interleaved 
block, and is set to 1 when it is. Otherwise the ILVU flag 

20 is set to 0. 

When a video object unit VOBU is in an interleaved 
block, a Unit END flag is declared to indicate whether 
the video object unit VOBU is the last VOBU in the inter- 
leaved unit ILVU. Because the interleaved unit ILVU is 

25 the data unit for continuous reading, the Unit END flag 
is set to 1 if the VOBU currently being read is the last 
VOBU in the interleaved unit ILVU. Otherwise the Unit 
END flag is set to 0. 

When a video object unit VOBU is in an interleaved 

30 block, an Interleaved Unit End Address (ILVU_EA) iden- 
tifying the address of the last pack in the ILVU to which 
the VOBU belongs is written. This address is expressed 
as the number of sectors from the navigation pack NV of 
that VOBU. In addition when a video object unit VOBU 

35 is in an interleaved block, the starting address of the 
next interleaved unit ILVU (NT_ILVU_SA) is also 
declared. This address is expressed as the number of 
sectors from the navigation pack NV of that VOBU. 
When two system streams are seamlessly con- 

40 nected but the audio components of the two system 
streams are not contiguous (the audio is different), par- 
ticularly immediately before and after the seam, it is 
necessary to pause the audio output to synchronize the 
audio and video components of the system stream fol- 

45 lowing the seam. With an NTSC signal, for example, the 
video frame cycle is approximately 33.33 msec while 
the AC-3 audio frame cycle is 32 msec. 

To enable this resynchronization, audio reproduc- 
tion stopping times 1 and 2, that is, Audio Stop PTM 1 in 

50 VOB (VOB_A_STP_PTM1), and Audio Stop PTM2 in 
VOB (VOB_A_STP_PTM2), indicating the time at which 
the audio is to be paused; and audio reproduction stop- 
ping periods 1 and 2. that is, Audio Gap Length 1 in 
VOB (VOB_A_GAP_LEN1) and Audio Gap Length 2 in 

55 VOB (VOB_A_GAP_LEN2), indicating for how long the 
audio is to be paused, are also declared. Note that 
these times are specified at the system clock precision 
(90 kHz). 
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The Angle Information for Seamless multi-angle 
playback (SML_AGLI) declares the read start address 
when the angle is changed. Note that this field is valid 
when seamless, multi-angle control is enabled. This 
address is also expressed as the number of sectors 
from the navigation pack NV of that VOBU. Because 
there are nine or fewer angles, there are nine angle 
address declaration cells (SML_AGL_C1_DSTA to 
SM L_AGL_C9_DSTA) . 

<3.2) pvd enc oder 

A preferred embodiment of an authoring encoder 
ECD in which the multimedia bitstream authoring sys- 
tem according to the present invention is applied to a 
DVD system as described above is described below 
and shown in Fig. 12. It will be obvious that the author- 
ing encoder ECD applied to the DVD system, referred to 
below as a DVD encoder, is substantially identical to the 
authoring encoder EC shown in Fig. 2. 

The basic difference between these encoders is the 
replacement in the DVD encoder ECD of the video zone 
formatter 1300 of the authoring encoder EC above with 
a VOB buffer 1000 and formatter 1100. it will also be 
obvious that the bitstream encoded by this DVD 
encoder ECD is recorded to a DVD medium M. The 
operation of this DVD encoder ECD is therefore 
described below in comparison with the authoring 
encoder EC described above. It should be noted that in 
the authoring encoder ECD shown in Rg. 12, the edit 
control command data St7' from the encoding system 
controller 200 is not fed back to the editing information 
generator 100, but this is not a real problem, and further 
description thereof is thus omitted below. 

As in the above authoring encoder EC, the encod- 
ing system controller 200 generates control signals St9, 
St11, St13, St21, St23, St25, St33, and St39 based on 
the scenario data St7 describing the user-defined edit- 
ing instructions input from the scenario editor 100, and 
controls the video encoder 300, subpicture encoder 
500, and audio encoder 700 in the DVD encoder ECD. 
Note that, similarly to the editing instruction content of 
the authoring system described above, the editing 
instruction content in the DVD system contains what 
source data is selected from all or a subset of the 
source data containing plural titles within a defined time 
period, and how the selected source data is reassem- 
bled to reproduce the scenario (sequence) intended by 
the user. Addition, the editing instruction content in the 
DVD system contains the following information. Specifi- 
cally, the editing instructions further contain such infor- 
mation as: the number of streams contained in the 
editing units, which are obtained by splitting a multi-title 
source stream into blocks at a constant time interval; 
the number of audio and subpicture data cells contained 
in each stream, and the subpicture display time and 
period; whether the user content is selected from plural 
streams including, for example, parental lock control or 



multi-angle control; and the method of connecting 
scenes when the angle is switched among the multiple 
viewing angles. 

The scenario data St7 of the DVD encoder ECD 

s also contains control information on a video object VOB 
unit basis. This information is required to encode the 
media source stream, and specifically includes such 
information as whether there are multiple angles, and 
whether the title is a multi-rated title enabling parental 

10 lock control. When multiple angle viewing is enabled, 
the scenario data St7 also contains the stream encod- 
ing bit rate reflecting the disk capacity and data inter- 
leaving for multi-angle control or parental lock control, 
the start and end times of each control period, and 

is whether a seamless connection should be made 
between the preceding and following streams. 

The encoding system controller 200 extracts this 
information from the scenario data St7, and generates 
the encoding information table and encoding parame- 

20 ters required for encoding control. The encoding infor- 
mation table and encoding parameters are described 
with reference to Rg. 13, Rg. 14, and Fig. 15 below. It 
should be noted that these encoding information tables 
and encoding parameters are shown by way of example 

25 when the authoring encoding parameters are generated 
lor DVD seamless authoring. 

The stream encoding data St33 contains the sys- 
tem stream encoding parameters and system encoding 
start and end timing values required by the DVD system 

30 to generate the VOBs. These system stream encoding 
parameters include the conditions for connecting one 
video object VOB with those before and after, the 
number of audio streams, the audio encoding informa- 
tion and audio IDs, the number of subpictures and the 

35 subpicture IDs, the video playback starting time infor- 
mation VPTS, and the audio playback starting time 
information APTS. The title sequence control signal 
St39 supplies the multimedia bitstream MBS formatting 
start and end timing information and formatting param- 

40 eters declaring the reproduction control information and 
interleave information. 

Based on the video encoding parameter and 
encoding start/end timing signal St9, the video encoder 
300 encodes a specific part of the video stream St1 to 

45 generate an elementary stream conforming to the 
MPEG-2 Video standard defined in ISO-13818. This 
elementary stream is output to the video stream buffer 
400 as video encoding stream St15. 

Note that while the video encoder 300 generates an 

so elementary stream conforming to the MPEG-2 Video 
standard defined in ISO-13818, specific encoding 
parameters are input via the video encoding parameter 
signal St9, including the encoding start and end timing, 
bit rate, the encoding conditions for the encoding start 

55 and end, the material type, including whether the mate- 
rial is an NTSC or PAL video signal or telecine con- 
verted material, and whether the encoding mode is set 
for either open GOP or closed GOP encoding 
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The MPEG-2 coding method is basically an inter- 
frame coding method using the correlation between 
frames. That is, the frame being coded (the target 
frame) is coded by referencing the frames before and 
after the target frame. However, intra-coded frames, that 5 
is, frames that are coded based solely on the content of 
the target frame, are also inserted to avoid error propa- 
gation and enable accessibility from mid-stream (ran- 
dom access). The coding unit containing at least one 
intra-coded frame C , intra-frame B ) is called a GOP. 10 

A GOP in which coding is closed completely within 
that GOP is known as a "closed GOP." A GOP contain- 
ing a frame coded with reference to a frame in a preced- 
ing GOP is an "open GOP." It is therefore possible to 
reproduce a closed GOP using only that GOP. Repro- 15 
ducing an open GOP, however, also requires the pres- 
ence of the preceding GOP. 

The GOP is often used as the access unit. For 
example, the GOP may be used as the playback start 
point for reproducing a title from the middle, as a transi- 20 
tion point in a movie, or for fast-forward play and other 
special reproduction modes. High speed reproduction 
can be achieved in such cases by reproducing only the 
intra-frame coded frames in a GOP or by reproducing 
only frames in GOP units. 25 

Based on the subpicture stream encoding parame- 
ter signal St11, the subpicture encoder 500 encodes a 
specific part of the subpicture stream St3 to generate a 
variable length coded bitstream of bitmapped data. This 
variable length coded bitstream data is output as the 30 
subpicture encoding stream St17 to the subpicture 
stream buffer 600. 

Based on the audio encoding signal St1 3, the audio 
encoder 700 encodes a specific part of the audio 
stream St5 to generate the encoded audio data. This 35 
encoded audio data may be data based on the MPEG- 

1 audio standard defined in ISO-1 1 1 72 and the MPEG- 

2 audio standard defined in ISO-1 381 8, AC-3 audio 
data, or PCM (LPCM) data. Note that the methods and 
means of encoding audio data according to these 40 
standards are known and commonly available. 

The video stream buffer 400 is connected to the 
video encoder 300 and stores the video encoding 
stream St15 input from the video encoder 300. The 
video stream buffer 400 is also connected to the encod- 45 
ing system controller 200, and based on the timing sig- 
nal St21 outputs the stored video encoding stream Stl 5 
as the time-delayed video encoding stream St27. 

The subpicture stream buffer 600 is similarly con- 
nected to the subpicture encoder 500 and stores the so 
subpicture encoding stream St1 7 input from the subpic- 
ture encoder 500. The subpicture stream buffer 600 is 
also connected to the encoding system controller 200, 
and based on the timing signal St23 outputs the stored 
subpicture encoding stream Stl 7 as time-delayed sub- ss 
picture encoding stream St29. 

The audio stream buffer 800 is similarly connected 
to the audio encoder 700 and stores the audio encoding 



stream St19 input from the audio encoder 700. The 
audio stream buffer 800 is also connected to the encod- 
ing system controller 200, and based on the timing sig- 
nal St25 outputs the audio encoding stream St 19 as the 
time-delayed audio encoding stream St31. 

The system encoder 900 is connected to the video 
stream buffer 400, subpicture stream buffer 600, and 
audio stream buffer 800, and receives therefrom time- 
delayed video encoding stream St27, time<telayed sub- 
picture encoding stream St29, and time-delayed audio 
encoding stream St31. The system encoder 900 is also 
connected to the encoding system controller 200, and 
receives therefrom St33 containing the encoding 
parameter data for system encoding. Note that the sys- 
tem encoder 900 multiplexes the time-delayed streams 
St27, St29, and St3l based on the encoding parameter 
data and encoding start/stop timing signal St33 to gen- 
erate title editing units (VOBs) St35. 

The VOB buffer 1000 temporarily stores the video 
objects VOBs produced by the system encoder 900. 
The formatter 1100 reads the delayed video objects 
VOB from the VOB buffer 1000 based on the title 
sequence control signal St39 to generate one video 
zone VZ, and adds the file system VFS to generate 
St43. 

The stream St43 edited according to the user- 
defined scenario is then sent to the recorder 1200. The 
recorder 1200 processes the edited multimedia bit- 
stream MBS to data St43 in a format determined by the 
recording medium M, and then records it to the record- 
ing medium M. 

<3.3)DVD decoder 

<3.3.1)Multiscene 

The concept of multiple angle scene control accord- 
ing to the present invention is described below with ref- 
erence to Fig. 9. As described above, a multiscene 
period is constructed from basic scene periods contain- 
ing data common to each title, and multi-scene periods 
comprising groups of different scenes corresponding to 
the various scenario requests. In the figure, scenes 1 , 5, 
and 8 are the common scenes. The angle scenes 
between scenes 1 and 5, and the parental lock control 
scenes between scenes 5 and 8, are the mufti-scene 
periods. 

Scenes taken from different angles, that is, angles 
1 , 2, and 3 in this example, can be dynamically selected 
and reproduced during playback in the multi-angle 
scene period. In the parental lock control period, how- 
ever, only one of the available scenes, scenes 6 and 7, 
having different content can be selected, and must be 
selected statically before playback begins. 

Which of these scenes from the multi-scene peri- 
ods is to be selected and reproduced is defined by the 
user operating the scenario selector 2100, thereby gen- 
erating the scenario selection data St5l. In scenario 1 
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in the figure, the user can freely select any of the multi- 
angle scenes, and scene 6 has been preselected for 
output in the parental lock control scene period. Simi- 
larly in scenario 2, the user can freely select any of the 
multi-angle scenes, and scene 7 has been preselected 
for output in the parental lock control scene period. 

The program chain information VTS_PGCI in a 
DVD data structure for multi-scene control as shown in 
Fig. 9 is described next below with reference to Rg. 16 
and Fig. 17. 

The user-selected scenarios shown in Fig. 9 are 
shown in Fig. 1 6 using the notation of a VTSI data struc- 
ture representing the internal structure of a video title 
set in the DVD data structure shown in Rg. 5. 

More specifically in Fig. 16, scenario 1 and sce- 
nario 2 shown in Rg. 9 are shown as the two program 
chains VTS__PGC#1 and VTS_PGC#2 of the program 
chain information VTS_PGCIT in VTSI of Fig. 5. More 
specifically, VTS_PGC#1 describing scenario 1 con- 
sists of cell playback information C_PBI#1 correspond- 
ing to scene 1, cell playback information C_PBI#2, 
C_PBI#3, and C_PBI#4 within a multi-angle cell block 
corresponding to a multi-angle scene, cell playback 
information C_PBI#5 corresponding to scene 5, cell 
playback information C_PBI#6 corresponding to scene 

6, and cell playback information C_PBI#7 correspond- 
ing to scene 8. 

Likewise, VTS_PGGI#2 describing scenario 2 con- 
sists of cell playback information C_PBI#1 correspond- 
ing to scene 1, cell playback information C_PBI#2, 
C_PBI#3, and C_PBI#4 within a multi-angle cell block 
corresponding to a multi-angle scene, cell playback 
information C_PBl#5 corresponding to scene 5, cell 
playback information C_PBI#6 corresponding to scene 

7, and C_PBI#7 corresponding to scene 8. 

In the DVD data structure, a scene, which is one 
reproduction control unit for a scenario, is represented 
as a DVD data structure unit called a "cell," which is thus 
used to construct user-selectable scenarios in the DVD 
structure. 

The user-selected scenarios shown in Fig. 9 are 
shown in Rg. 1 7 using the notation of a VOB data struc- 
ture VTSTT_VOBS representing multimedia bitstream 
for a video title set in the DVD data structure shown in 
Fig. 5. 

As specifically shown in Rg. 17, the two scenarios 
1 and 2 use the VOB data for one title in common. Indi- 
vidual scenes common to each scenario are placed in 
non-interleaved blocks, that is, to contiguous blocks, as 
independent VOB units. In this figure, these are VOB#1 
corresponding to scene 1, VOB#5 corresponding to 
scene 5, and VOB#8 corresponding to scene 8. 

In the multi-angle scenes common to scenario 1 
and scenario 2, one VOB constitutes one angle, and the 
VOB are interleaved in an interleave block to enable 
seamless reproduction of each angle and switching 
between angles. In Fig. 17, VOB #2 represents angle 1, 
VOB #3 represents angle 2, and VOB #4 represents 



angle 3. 

Scene 6 and scene 7, that is, the scenes unique to 
scenario 1 and scenario 2, are also placed in an inter- 
leaved block to enable seamless reproduction of the 
5 scenes in each unique scenario, and to enable seam- 
less reproduction to the common scenes before and 
after. 

As described above, the user-requested scenario 
shown in Rg. 9 can be realized in a DVD data structure 
10 by utilizing the video title playback control information 
shown in Fig. 16, and the title playback VOB data struc- 
ture shown in Fig. 17. 

( 3.3.2 ) Seaml es s pla ybac k 

15 

The seamless playback capability* mentioned 
above with regard to the DVD system data structure is 
described below. Note that seamless playback refers to 
the reproduction in a digital video disk system of multi- 
20 media data including video, audio, and subpicture data 
without intermittent breaks in the data or information 
between basic scene periods, between basic scene 
periods and multi-scene periods, and between multi- 
scene periods. 

25 Hardware factors contributing to intermittent play- 
back of this data and title content include decoder 
underflow, that is, an imbalance between the source 
data input speed and the decoding speed of the input 
source data. 

30 Other factors relate to the properties of the play- 
back data. When the playback data is data that must be 
continuously reproduced for a constant time unit in 
order for the user to understand the content or informa- 
tion, e.g., audio data, data continuity is lost when the 

35 required continuous presentation time cannot be sus- 
tained. Reproduction of such information whereby the 
required continuity is sustained is referred to as "contin- 
uous information reproduction," or "seamless informa- 
tion reproduction." Reproduction of this information 

40 when the required continuity cannot be sustained is 
referred to as "non-continuous information reproduc- 
tion," or "non-seamless information reproduction." It is 
obvious that continuous information reproduction and 
non-continuous information reproduction are, respec- 

45 tively, seamless and non-seamless reproduction. 

Note that seamless reproduction can be further cat- 
egorized as seamless data reproduction and seamless 
information reproduction. Seamless data reproduction 
is defined as preventing physical blanks or interruptions 

so in the data playback (intermittent reproduction) as a 
result of a buffer underflow, for example,. Seamless 
information reproduction is defined as preventing appar- 
ent interruptions in the information as perceived by the 
user (intermittent presentation) during intellection of 

55 information from the playback data where there are no 
actual physical breaks in the actual reproduction of 
data. 
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( 3.3.3 )Petails of seamless playback 

A specific method enabling seamless reproduction 
as thus described is described later below with refer- 
ence to Fig. 10 and Fig. 11. 

(3.4.1 ) Interleaving 

The DVD data system streams described above are 
recorded using an appropriate authoring encoder EC as 
a movie or other multimedia title on a DVD recording 
medium. 

Supplying a single movie in a format enabling the 
movie to be used in a plurality of different cultural 
regions or countries requires the script to be recorded in 
the various languages used in those regions or coun- 
tries. It may even necessitate editing the content to con- 
form to the mores and moral expectations of different 
cultures. Even using such a large-capacity storage sys- 
tem as the DVD system, however, it is necessary to 
reduce the bit rate, and therefore the image quality, if a 
plurality of full-length titles edited from a single common 
source title are recorded to a single disk. This problem 
can be solved by recording the common parts of plural 
titles only once, and recording the segments different in 
each title for each different title only. This method makes 
it possible to record a plurality of titles for different coun- 
tries or cultures to a single optica) disk without reducing 
the bit rate, and, therefore, retaining high image quality. 

As shown in Rg. 9, the titles recorded to a single 
optica) disk contain basic scene periods of scenes com- 
mon to all scenarios, and multi-scene periods contain- 
ing scenes specific to certain scenarios, to provide 
parental lock control and multi-angle scene control func- 
tions. 

In the case of the parental lock control function, 
titles containing sex scenes, violent scenes, or other 
scenes deemed unsuitable for children, that is, so- 
called "adult scenes," are recorded with a combination 
of common scenes, adult scenes, and children's 
scenes. These title streams are achieved by arraying 
the adult and children's scenes to mufti-scene periods 
between the common basic scene periods. 

Mufti-angle control can be achieved in a conven- 
tional single-angle title by recording plural muftimetfa 
scenes obtained by recording the subjects from the 
desired plural camera angles to the mufti-scene periods 
arrayed between the common basic scene periods. 
Note, however, that while these plural scenes are 
described here as scenes recorded from different cam- 
era angles (positions), it will be obvious that the scenes 
may be recorded from the same camera angle but at dif- 
ferent times, data generated by computer graphics, or 
other video data. 

When data is shared between different scenarios of 
a single title, it is obviously necessary to move the laser 
beam LS from the common scene data to the non-com- 
mon scene data during reproduction, that is, to move 



the optical pickup to a different position on the DVD 
recording medium RC1. The problem here is that the 
time required to move the optical pickup makes it diffi- 
cult to continue reproduction without creating breaks in 

5 the audio or video, that is, to sustain seamless repro- 
duction. This problem can be theoretically solved by 
providing a track buffer (stream buffer 2400) to delay 
data output an amount equivalent to the worst access 
time. In general, data recorded to an optical disk is read 

w by the optical pickup, appropriately processed, and tem- 
porarily stored to a track buffer. The stored data is sub- 
sequently decoded and reproduced as video or audio 
data. 

is (3.4.2) Definition of interleaving 

To thus enable the user to selectively cut scenes 
and choose from among a plurality of scenes, non- 
selected scene data is necessarily recorded inserted 

20 between common scene data and selective scene data 
because the data units associated with individual 
scenes are contiguously recorded to the recording 
tracks of the recording medium, ft data is then read in 
the recorded sequence, non-selected scene data must 

25 be accessed before accessing and decoding the 
selected scene data, and seamless connection of the 
selected scenes is difficult The excellent random 
access characteristics of the DVD system, however, 
make seamless connection between a plurality of 

30 scenes possible. 

In other words, by splitting scene-specific data into 
a plurality of units of a specified data size, and interleav- 
ing a plurality of split data units for different scenes in a 
predefined sequence within the jumping range whereby 

35 a data underflow state does not occur, it is possible to 
reproduce the selected scenes without data interruption 
by intermittently accessing and decoding the data spe- 
cific to the selected scenes using these split data units. 
In other words, seamless data reproduction can be sus- 

40 tained. 

( 3.4.3 ) Structure of interleave blocks and units 

An interleaving method enabling seamless data 
45 reproduction according to the present invention is 
described below with reference to Fig. 1 1 and Fig. 18. 
Shown in Fig. 1 1 is a case from which reproduction can 
branch from one video object VOB-A to one of a plural- 
it/ of video objects VOB-B, VOB-C. and VOB-D, and 
so then merge back again to a single video object VOB-E. 
The actual arrangement of these blocks recorded to a 
data recording track TR on disk is shown in Fig. 18. 

Referring to Fig. 18, VOB-A and VOB-E are video 
objects with independent playback start and end times, 
55 and are in principle arrayed to contiguous block regions. 
As shown in Rg. 1 1 , the playback start and end times of 
VOB-B, VOB-C, and VOB-D are aligned during inter- 
leaving. The interleaved data blocks are then recorded 
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to a contiguous region of the disk as an interleaved 
block region. The contiguous block regions and inter- 
leaved block regions are then written to disk in the track 
path Dr direction in the playback sequence. A plurality of 
video objects VOB, that is, VOBS, arrayed to the data 5 
recording track TR are shown in Fig. 18. 

Referring to Fig. 18, data regions to which data is 
continuously arrayed are called "blocks," of which there 
are two types: "contiguous block regions" in which VOB 
with discrete starting and end points are contiguously 10 
arrayed, and "interleaved block regions" in which plural 
VOB with aligned starting and end points are inter- 
leaved. The respective blocks are arrayed as shown in 
Fig. 34 in the playback sequence, that is, block 1 , block 
2, block 3, ...block 7. 15 

As shown in Fig. 34, a VTS title VOBS 
(VTSTTVOBS) consist of blocks 1 , 2, 3, 4, 5, 6, and 7. 
Block 1 contains VOB 1 alone. Blocks 2, 3, 5, and 7 sim- 
ilarly discretely contain VOBS 2, 3, 6. and 10. Blocks 2, 
3, 5, and 7 are thus contiguous block regions. 20 

Block 4, however, contains VOB 4 and VOB 5 inter- 
leaved together, while block 6 contains three VOB, VOB 
7, VOB 8. and VOB 9. interleaved together. Blocks 4 
and 6 are thus interleaved block regions. 

The internal data structure of the contiguous block 25 
regions is shown in Fig. 35 with VOB-i and VOB-j 
arrayed as the contiguous blocks in the VOBS. As 
described with reference to Fig. 5, VOB-i and VOB-j in a 
contiguous block region are further logically divided into 
cells as the playback unit. Both VOB-i and VOB-j in this 30 
figure are shown comprising three cells CELL #1 , CELL 
#2, and CELL #3. 

Each cell comprises one or more video object unit 
VOBU with the video object unit VOBU defining the 
boundaries of the cell. Each cell also contains informa- 35 
tion identifying the position of the cell in the program 
chain (PGC below) as shown in Fig. 5. More specifically, 
this position information is the address of the first and 
last VOBU in the cell. As also shown in Fig. 35, these 
VOB and the cells defined therein are also recorded to 40 
a contiguous block region so that contiguous blocks are 
contiguously reproduced. Reproducing these contigu- 
ous blocks is therefore no problem. 

The internal data structure of the interleaved block 
regions is shown in Fig. 36. In the interleaved block 45 
regions each video object VOB is divided into inter- 
leaved units ILVU, and the interleaved units ILVU asso- 
ciated with each VOB are alternately arrayed. Cell 
boundaries are defined independently of the interleaved 
units ILVU. For example, VOB-k is divided into four inter- so 
leaved units ILVUkl, ILVUk2, ILVUk3, and ILVUk4, and 
two cells CELL #1k and CELL #2k are defined. VOB-m 
is likewise divided into four interleaved units ILVUml, 
ILVUm2, ILVUm3, and ILVUm4, and two cells CELL 
#1 m and CELL #2m are defined. The interleaved units ss 
ILVU thus contains both audio and video data. 

In the example shown in Fig. 36, the interleaved 
units ILVUkl, ILVUk2, ILVUk3, and ILVUk4, and 



ILVUml, ILVUm2, ILVUm3, and ILVUm4, from two dif- 
ferent video objects VOB-k and VOB-m are alternately 
arrayed within a single interleaved block. By interleaving 
the interleaved units ILVU of two video objects VOB in 
this sequence, it is possible to achieve seamless repro- 
duction branching from one scene to one of a plurality of 
scenes, and from one of a plurality of scenes to one 
scene. This interleaving process further enables seam- 
less reproduction of scenes from various scenario 
threads in most cases. 

( 3.5. 1 ) Muftiscene control 

The concept of multiple angle scene control and 
multiscene periods according to the present invention 
are described below. 

An example of a scene recorded from a plurality of 
different angles has been given above. However, the 
scenes in a multiscene period can alternatively be 
scenes recorded from the same angle but at a different 
time, or data such as computer graphics. In other words, 
a multi-angle scene period is a multi-scene period. 

< 3.5.2 ) Parental control 

The concept of recording plural titles comprising 
alternative scenes for such functions as parental lock 
control and recording a director's cut is described below 
with reference to Fig. 4. 

An example of a multi-rated title stream providing 
for parental lock control is shown in Fig. 4. When so- 
called "adult scenes" containing sex, violence, or other 
scenes deemed unsuitable for children are contained in 
a title implementing parental lock control, the title 
stream is recorded with a combination of common sys- 
tem streams SSa, SSb, and SSe, an adult-oriented sys- 
tem stream SSc containing the adult scenes, and a 
child-oriented system stream SSd containing only the 
scenes suitable for children. Title streams such as this 
are recorded as a multi-scene system stream contain- 
ing the adult-oriented system stream SSc and the child- 
oriented system stream SSd arrayed to the multi-scene 
period between common system streams SSb and SSe. 

The relationship between each of the component 
titles and the system stream recorded to the program 
chain PGC of a title stream thus comprised is described 
below. 

The adult-oriented title program chain PGC1 com- 
prises in sequence the common system streams SSa 
and SSb, the adult-oriented system stream SSc, and 
the common system stream SSe. The child-oriented 
title program chain PGC2 comprises in sequence the 
common system streams SSa and SSb, the child-ori- 
ented system stream SSd. and the common system 
stream SSe. 

By thus arraying the adult-oriented system stream 
SSc and child-oriented system stream SSd to a multi- 
scene period, the decoding method previously 
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described can reproduce the title containing adult-ori- 
ented content by reproducing the common system 
streams SSa and SSb. then selecting and reproducing 
the adult-oriented system stream SSc, and then repro- 
ducing the common system stream SSe as instructed 
by the adult-oriented title program chain PGC notation. 
In addition, a child-oriented title from which the adult- 
oriented scenes have been expurgated can be repro- 
duced by selecting the child-oriented system stream 
SSd in the multi-scene period. 

This method of providing in the title stream a multi- 
scene period containing plural alternative scenes, 
selecting which of the scenes in the multi-scene period 
are to be reproduced before playback begins, and gen- 
erating plural titles containing essentially the same title 
content but different scenes in part is called parental 
lock control. 

Note that parental lock control is so named 
because of the intent to protect children from undesira- 
ble content. From the perspective of system stream 
processing, however, parental lock control is a technol- 
ogy for statically generating different title streams by 
means of the user pre-selecting specific scenes from a 
multi-scene period. Note, further, that this contrasts with 
multi-angle scene control, which is a technology for 
dynamically changing the content of a single title by 
means of the user selecting scenes from the multi- 
scene period freely and in real-time during title play- 
back. 

Title stream editing enabling what is known as the 
"director's cut" can also be achieved using this parental 
lock control technology. The director's cut refers to the 
process of editing certain scenes from a movie to, for 
example, shorten the total presentation time. This may 
be necessary, for example, to edit a feature-length 
movie for viewing on an airplane where the presentation 
time is too long for viewing within the flight time. The 
movie director or producer thus determines which 
scenes may be cut to shorten the movie. The title can 
then be recorded with both a full-length, unedited sys- 
tem stream and a director-controlled edited system 
stream in which the edited scenes are recorded to multi- 
scene periods. At the transition from one system stream 
to another system stream in such applications, parental 
lock control must be able to maintain smooth playback 
image output More specifically, seamless data repro- 
duction whereby a data underflow state does not occur 
in the audio, video, or other buffers, and seamless infor- 
mation reproduction whereby no unnatural interruptions 
are audibly or visibly perceived in the audio and video 
playback, are necessary. 

< 3.5.3 )Murrj-anole control 

The concept of multi-angle scene control in the 
present invention is described next with reference to 
Fig. 19. In general, multimedia titles are obtained by 
recording both the audio and video information (collec- 



tively "recording" below) of the subject over time T. The 
blocks #SC1 , #SM1 , #SM2, #SM3, and #SC3 represent 
the multimedia scenes obtained at recording unit times 
T1, T2, and T3 by recording the subject at respective 
5 camera angles. Scenes #SM1, #SM2, and #SM3 are 
recorded at mutually different (first second, and third) 
camera angles during recording unit time T2, and are 
referenced below as the first, second, and third multi- 
angle scenes. 

w Note that the multi-scene periods referenced herein 
are assumed to comprise scenes recorded from differ- 
ent angles. The scenes may, however, be recorded from 
the same angle but at different times, or they may be 
computer graphics data or other data. The multi-angle 

15 scene periods are thus multi-scene periods from which 
a plurality of scenes can be selected for presentation in 
the same time period, whether or not the scenes are 
actually recorded at different camera angles. 

Scenes #SC1 and #SC3 are scenes recorded at 

20 the same common camera angle during recording unit 
times T1 and T3, that is, before and after the multi-angle 
scenes. These scenes are therefore called "common 
angle scenes." Note that one of the multiple camera 
angles used in the mufti-angle scenes is usually the 

25 same as the common camera angle. 

To understand the relationship between these vari- 
ous angle scenes, multi-angle scene control is 
described below using by way of example only a live 
broadcast of a baseball game. 

30 The common angle scenes #SC1 and #SC3 are 
recorded at the common camera angle, which is here 
defined as the view from center field on the axis through 
the pitcher, batter, and catcher. 

The first angled scene #SM1 is recorded at a first 

35 multi-camera angle, that is, the camera angle from the 
backstop on the axis through the catcher, pitcher, and 
batter. The second angled scene #SM2 is recorded at a 
second multi-camera angle, that is, the view from center 
field on the axis through the pitcher, batter, and catcher. 

40 Note that the second angled scene #SM2 is thus the 
same as the common camera angle in this example. It 
therefore follows that the second angled scene #SM2 is 
the same as the common angle scene #SC2 recorded 
during recording unit time T2. The third angled scene 

45 #SM3 is recorded at a third multi-camera angle, that is, 
the camera angle from the backstop focusing on the 
infield. 

The presentation times of the multiple angle scenes 
#SM1, #SM2, and #SM3 overlap in recording unit time 

so T2; this period is called the "multi-angle scene period." 
By freely selecting one of the multiple angle scenes 
#SM1. #SM2, and #SM3 in this multi-angle scene 
period, the viewer is able to change his or her virtual 
viewing position to enjoy a different view of the game as 

55 though the actual camera angle is changed. Note that 
while there appears to be a time gap between common 
angle scenes #SC1 and #SC3 and the multiple angle 
scenes #SM1, #SM2. and #SM3 in the figure, this is 
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simply to facilitate the use of arrows in the figure for eas- 
ier description of the data reproduction paths repro- 
duced by selecting different angled scenes. That there 
is no actual time gap during playback will be obvious. 

Multi-angle scene control of the system stream s 
based on the present invention is described next with 
reference to Fig. 10 from the perspective of connecting 
data blocks. The multimedia data corresponding to 
common angle scene #SC is referenced as common 
angle data BA, and the common angle data BA in w 
recording unit times T1 and 73 are referenced as BA1 
and BA3, respectively. The multimedia data corre- 
sponding to the multi-angle scenes #SM1, #SM2, and 
#SM3 are referenced as first, second, and third angle 
scene data MA1, MA2, and MA3. As previously 15 
described with reference to Fig. 19, scenes from the 
desired angled can be viewed by selecting one of the 
multiple angle data units MA1 , MA2, and MA3. There is 
also no time gap between the common angle data BA1 
and BA3 and the multiple angle data units MA1 , MA2, 20 
and MA3. 

In the case of an MPEG system stream, however, 
intermittent breaks in the playback information can 
result between the reproduced common and multiple 
angle data units depending upon the content of the data 2s 
at the connection between the selected multiple angle 
data unit MA1, MA2, and MA3 and the first common 
angle data BA1 before the angle selected in the multi- 
angle scene period or the common angle data BA3 fol- 
lowing the selected angle. The result in this case is that 30 
the title stream is not naturally reproduced as a single 
contiguous title, that is, seamless data reproduction is 
achieved but non-seamless information reproduction 
results. 

The multi-angle selection process whereby one of 35 
plural scenes is selectively reproduced from the multi- 
angle scene period with seamless presentation of infor- 
mation before and after scene changes is described 
below with application in a DVD system using Fig. 10. 

Changing the scene angle, that is, selecting one of <o 
the multiple angle data units MA1 , MA2, and MA3, must 
be completed before reproduction of the preceding 
common angle data BA1 is completed. It is extremely 
difficult, for example, to change to a different angle data 
unit MA2 during reproduction of common angle data 45 
BA1 . This is because the multimedia data has a variable 
length coded MPEG data structure, which makes it diffi- 
cult to find the data break points (boundaries) in the 
selected data blocks. The video may also be disrupted 
when the angle is changed because inter-frame correla- so 
tions are used in the coding process. The GOP process- 
ing unit of the MPEG standard contains at least one 
refresh frame, and closed processing not referencing 
frames belonging to another GOP is possible within this 
GOP processing unit. ss 

In other words, if the desired angle data, for exam- 
ple, MA3, is selected before reproduction reaches the 
multi-angle scene period, and at the latest by the time 



reproduction of the preceding common angle data BA1 
is completed, the angle data selected from within the 
multi-angle scene period can be seamlessly repro- 
duced. However, it is extremely difficult while reproduc- 
ing one angle to select and seamlessly reproduce 
another angle within the same multi-angle scene period. 
It is therefore difficult when in a multi-angle scene 
period to dynamically select a different angle unit pre- 
senting, for example, a view from a different camera 
angle. 

(3.6.1 )Flow chart: encoder 

The encoding information table generated by the 
encoding system controller 200 based on the scenario 
data St7 is described below referring to Fig. 1 13. 

The encoding information table contains VOB set 
data streams containing a plurality of VOB correspond- 
ing to the scene periods beginning and ending at the 
scene branching and connecting points, and VOB data 
streams corresponding to each scene. The VOB set 
data streams shown in Fig. 13 are described in detail 
below. 

Thee VOB set data streams shown in Fig. 13 are 
the encoding information tables generated at step #100 
in Fig. 20 by the encoding system controller 200 for cre- 
ating the DVD multimedia stream based on the user- 
defined title content. 

The user-defined scenario contains branching 
points from common scenes to plural scenes, or con- 
nection points to other common scenes. The VwOB cor- 
responding to the scene period delimited by these 
branching and connecting points is a VOB set, and the 
data generated to encode a VOB set is the VOB set 
data stream. When a muitiscene period is contained in 
a VOB set data stream, the title number specified by the 
VOB set data stream is the title number TITLE_NO of 
the VOB set data stream. 

The VOB Set data structure in Fig. 13 shows the 
data content for encoding one VOB set in the VOB set 
data stream. The VOB Set data structure comprises: 
the VOB set number VOBS_NO, the VOB number 
VOBJMO in the VOB set, the preceding VOB seamless 
connection flag VOB_Fsb, the following VOB seamless 
connection flag VOB_Fsf. the multi-scene flag VOB_Fp, 
the interleave flag VOB_Fi, the multi-angle flag 
VOB_Fm, the multi-angle seamless switching flag 
VOB_FsV, the maximum bit rate of the interleaved VOB 
ILV_BR the number of interleaved VOB divisions 
ILVJD1V, and the minimum interleaved unit presentation 
time ILVU_MT. 

The VOB set number VOBS_NO is a number iden- 
tifying the VOB set, and is assigned based on the repro- 
duction sequence of the title scenario. 

The VOB number VOBjsiO is a number identifying 
the VOB, and is assigned, for example, throughout a 
title scenario based on the reproduction sequence of 
the title scenario. 
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The preceding VOB seamless connection flag 
VOB_Fsb indicates whether a seamless connection 
with the preceding VOB is required for scenario repro- 
duction. 

The following VOB seamless connection flag 
VOB_Fsf indicates whether there is a seamless con- 
nection with the following VOB during scenario repro- 
duction. 

The multi-scene flag VOB_Fp identifies whether the 
VOB set comprises plural video objects VOB. 

The interleave flag VOB_R identifies whether the 
VOB in the VOB set are interleaved. 

The multi-angle flag VOB_Fm identifies whether the 
VOB set is a multi-angle set. 

The multi-angle seamless switching flag VOB_FsV 
identifies whether angle changes within the multi-angle 
scene period are seamless or not 

The maximum bit rate of the interleaved VOB 
ILV_BR defines the maximum bit rate of the interleaved 
VOBs. 

The number of interleaved VOB divisions ILV_DIV 
identifies the number of interleave units in the inter- 
leaved VOB. 

The minimum interleave unit presentation time 
ILVU_MT defines the time that can be reproduced when 
the bit rate of the smallest interleave unit at which a 
track buffer data underflow state does not occur is the 
maximum bit rate of the interleaved VOB ILV_BR during 
interleaved block reproduction. 

The encoding information table for each VOB gen- 
erated by the encoding system controller 200 based on 
the scenario data St7 is described below referring to 
Fig. 14. The VOB encoding parameters described 
below and supplied to the video encoder 300, subpic- 
ture encoder 500, audio encoder 700, and system 
encoder 900 for stream encoding are produced based 
on this encoding information table. 

The VOB data streams shown in Fig. 14 are the 
encoding information tables generated at step #100 in 
Fig. 20 by the encoding system controller 200 for creat- 
ing the DVD multimedia stream based on the user- 
defined title content 

Trie encoding unit is the video object VOB, and the 
data generated to encode each video object VOB is the 
VOB data stream. For example, a VOB set comprising 
three angle scenes comprises three video objects VOB. 
The data structure shown in Fig. 14 shows the content 
of the data for encoding one VOB in the VOB data 
stream. 

The VOB data structure contains the video material 
start time VOB_VST, the video material end time 
VOB_VEND, the video signal type VOB_V_KIND, the 
video encoding bit rate V_BR, the audio material start 
time VOB_AST, the audio material end time 
VOB_AEND, the audio coding method VOBJW<IND, 
and the audio encoding bit rate A_BR. 

The video material start time VOB_VST is the video 
encoding start time corresponding to the time of the 



video signal. 

The video material end time VOB_VEND is the 
video encoding end time corresponding to the time of 
the video signal. 
5 The video material type VOB_VJ<IND identifies 
whether the encoded material is in the NTSC or PAL for- 
mat for example, or is telecine-converted video mate- 
rial. 

The video encoding bit rate V_BR is the bit rate at 
w which the video signal is encoded. 

The audio material start time VOB_AST is the 
audio encoding start time corresponding to the time of 
the audio signal. 

The audio material end time VOB_AEND is the 
15 audio encoding end time corresponding to the time of 
the audio signal 

The audio coding method VOB_A_KIIMD identifies 
the audio encoding method as AC-3, MPEG, or linear 
PCM, for example. 
20 The audio encoding bit rate A_BR is the bit rate at 
which the audio signal is encoded. 

The encoding parameters supplied to the video, 
audio, and system encoders 300, 500 and 900 for VOB 
encoding are shown in Fig. 15. The encoding parame- 
25 ters include: the VOB number VOB_NO, video encoding 
start time V_STTM, video encoding end time 
V_ENDTM, the video encoding mode V_ENCMD, the 
video encoding bit rate V_RATE, the maximum video 
encoding bit rate V_MRATE, the GOP structure fixing 
30 flag GOP^FXflag, the video encoding GOP structure 
GOPST, the initial video encoding data VJNTST, the 
last video encoding data V_ENDST, the audio encoding 
start lime A_STTM, the audio encoding end time 
A_ENDTM, the audio encoding bit rate A_RATE, the 
35 audio encoding method AJENCMD, the audio start gap 
A_STGAP, the audio end gap A_ENDGAP, the preced- 
ing VOB number B_VOB_NO, and the following VOB 
number F_VOB_NO. 

The VOB number VOB_NO is a number identifying 
40 the VOB, and is assigned throughout the title scenario 
based on the position of the VOB in the reproduction 
sequence of the title scenario. 

The video encoding start time V_STTM is the start 
time of video material encoding. 
45 The video encoding end time V_STTM is the end 
time of video material encoding. 

The video encoding mode V_ENCMD is an encod- 
ing mode for declaring whether reverse telecine conver- 
sion shall be accomplished during video encoding to 
so enable efficient coding when the video material is tele- 
cine-converted material. 

The video encoding bit rate V_RATE is the average 
bit rate of video encoding. 

The maximum video encoding bit rate V_MRATE is 
55 the maximum bit rate of video encoding. 

The GOP structure fixing flag GOP_FXflag speci- 
fies whether encoding is accomplished without chang- 
ing the GOP structure in the middle of the video 
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encoding process. This is a useful parameter for ena- 
bling seamless scene changing in a multi-angle scene 
period. 

The video encoding GOP structure GOPST is the 
GOP structure data from encoding. 

The initial video encoding data VJNST sets the ini- 
tial value of the VBV buffer (decoding buffer) at the start 
of video encoding. This is a useful parameter for declar- 
ing seamless reproduction with the preceding video 
encoding stream. 

The last video encoding data V_ENDST sets the 
end value of the VBV buffer (decoding buffer) at the end 
of video encoding. This is a useful parameter for declar- 
ing seamless reproduction with the following video 
encoding stream. 

The audio encoding start time AJ5TTM is the start 
time of audio encoding. 

The audio encoding end time A_ENDTM is the end 
time of audio encoding. 

The audio encoding bit rate A_RATE is the bit rate 
used for audio encoding. 

The audio encoding method A_ENCMD identifies 
the audio encoding method as AC-3, MPEG, or linear 
PCM, for example. 

The audio start gap A_STGAP is the time offset 
between the start of the audio and video presentation at 
the beginning of a VOB. This is a useful parameter for 
declaring seamless reproduction with the preceding 
system encoding stream. 

The audio end gap A_ENDGAP is the time offset 
between the end of the audio and video presentation at 
the end of a VOB. This is a useful parameter for declar- 
ing seamless reproduction with the following system 
encoding stream. 

The preceding VOB number B_VOB_NO is the 
VOB_NO of the preceding VOB when there is a seam- 
lessly connected preceding VOB. 

The following VOB number F_VOB_NO is the 
VOB_NO of the following VOB when there is a seam- 
lessly connected following VOB. 

The operation of a DVD encoder ECD according to 
the present invention is described below with reference 
to the flow chart in Fig. 20 and Fig. 21. Note that the 
steps shown with a double line are subroutines. It 
should be obvious that while the present embodiment is 
described below with reference to a DVD system, the 
described operation also applies to an authoring 
encoder EC. 

At step #100, the user inputs the editing commands 
using the editing information generator 100 according to 
the desired scenario while confirming the content of the 
multimedia source data streams St1 , St2, and St3. 

At step #200, the scenario editor 100 generates the 
scenario data St7 containing the above edit command 
information according to the user's editing instructions. 

When generating the scenario data St7 in step 
#200, the user editing commands related to multi-angle 
and parental lock multi-scene periods in which inter- 



leaving is presumed must be input to satisfy the follow- 
ing conditions. 

First the VOB maximum bit rate must be set to 
assure sufficient image quality, and the track buffer 

5 capacity, jump performance, jump time, and jump dis- 
tance of the DVD decoder DCD assumed to be used as 
the reproduction apparatus of the DVD encoded data 
must be determined. Based on these values, the repro- 
duction time of the shortest interleaved unit is obtained 

10 from equations 3 and 4. 

Based on the reproduction time of each scene in 
the multi-scene period, it must then be determined 
whether equations 5 and 6 are satisfied. If equations 5 
and 6 are not satisfied, the user must change the edit 

is commands until equations 5 and 6 are satisfied by, for 
example, connecting part of the following scene to each 
scene in the multi-scene period. 

When multi-angle edit commands are used, equa- 
tion 7 must be satisfied for seamless switching, and edit 

20 commands matching the audio reproduction time with 
the reproduction lime of each scene in each angle must 
be entered. If non-seamless switching is used, the user 
must enter commands to satisfy equation 8. 

At step #300, the encoding system controller 200 

25 first determines whether the target scene is to be seam- 
lessly connected to the preceding scene based on the 
scenario data St7. 

A seamless connection refers to seamlessly con- 
necting the common scene that is the presently 

30 selected target scene with any one scene selected from 
all scenes contained in a preceding multi-scene period 
when the preceding scene period is a multi-scene 
period comprising a plurality of scenes. Likewise, when 
the presently selected target scene is a multi-scene 

35 period, a seamless connection still refers to seamlessly 
connecting the target scene with any one of the scenes 
from the multi-scene period. 

If step #300 returns NO, that is, the connection is 
non-seamless, the procedure moves to step #400. 

40 At step #400, the encoding system controller 200 
resets the preceding VOB seamless connection flag 
VOB_Fsb indicating whether there is a seamless con- 
nection between the target and preceding scenes. The 
procedure then moves to step #600. 

45 On the other hand, if step #300 returns YES, that is, 
there is a seamless connection to the preceding scene, 
the procedure moves to step #500. 

At step #500 the preceding VOB seamless connec- 
tion flag VOB__Fsb is set. The procedure then moves to 

so step #600. 

At step #600 the encoding system controller 200 
determines whether there is a seamless connection 
between the target and following scenes based on sce- 
nario data St7. If step #600 returns NO, that is, the con- 

55 nection is non-seamless, the procedure moves to step 
#700. 

At step #700, the encoding system controller 200 
resets the following VOB seamless connection flag 
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VOB_Fsf indicating whether there is a seamless con- 
nection with the following scene. The procedure then 
moves to step #900. 

However, if step #600 returns YES. that is, there is 
a seamless connection to the following scene, the pro- s 
cedure moves to step #800. 

At step #800 the encoding system controller 200 
sets the following VOB seamless connection flag 
VOB_Fsf. The procedure then moves to step #900. 

At step #900 the encoding system controller 200 10 
determines whether there is more than one connection 
target scene, that is, whether a multi-scene period is 
selected, based on the scenario data St7. As previously 
described, there are two possible control methods in 
multi-scene periods: parental lock control whereby only is 
one of plural possible reproduction paths that can be 
constructed from the scenes in the multi-scene period is 
reproduced, and multi-angle control whereby the repro- 
duction path can be switched within the multi-scene 
period to present different viewing angles. 20 

If step #900 returns NO, that is, the connection is 
not between multiple scenes, the procedure moves to 
step #1000. 

At step #1000 the multi-scene flag VOB_Fp identi- 
fying whether a multi-scene period is selected is reset, 25 
and the procedure moves to step #1800 for encoding 
parameter generation. The operation of step #1800 is 
described below. 

However, if step #900 returns YES, that is, there is 
a multi-scene connection, the procedure moves to step 30 
#1100. 

At step #1 100, the multi-scene flag VOB_Fp is set, 
and the procedure moves to step #1200 where a multi- 
angle connection is evaluated. 

At step #1200 it is determined whether a change 35 
can be made between plural scenes in the multi-scene 
period, that is, whether a multi-angle scene period is 
selected. If step #1200 returns NO, that is, no scene 
change is allowed in the multi-scene period as parental 
lock control reproducing only one reproduction path has <o 
been selected, the procedure moves to step #1300. 

At step #1300 the multi-angle flag VOB_Fm identi- 
fying whether the target connection scene is a multi- 
angle scene is reset and the procedure moves to step 
#1302. 45 

At step #1302 it is determined whether either the 
preceding VOB seamless connection flag VOB_Fsb or 
following VOB seamless connection flag VOB_Fsf is 
set If step #1302 returns YES, that is, the target con- 
nection scene seamlessly connects to the preceding, so 
the following, or both the preceding and following 
scenes, the procedure moves to step #1304. 

At step #1304 the interleave flag VOB_Fi identifying 
whether the VOB, the encoded data of the target scene, 
is interleaved is set The procedure then moves to step ss 
#1800. 

However, if step #1302 returns NO. that is, the tar- 
get connection scene does not seamlessly connect to 



the preceding or following scene, the procedure moves 
to step #1306. 

At step #1306 the interleave flag VOB_Fi is reset, 
and the procedure moves to step #1800. 

If step #1200 returns YES, however, that is, there is 
a multi-angle connection, the procedure moves to step 
#1400. 

At step #1400, the multi-angle flag VOB_Fm and 
interleave flag VOB_Fi are set and the procedure 
moves to step #1500. 

At step #1500 the encoding system controller 200 
determines whether the audio and video can be seam- 
lessly switched in a multi-angle scene period, that is, at 
a reproduction unit smaller than the VOB, based on the 
scenario data St7. If step #1500 returns NO. that is, 
non-seamless switching occurs, the procedure moves 
to step #1600. 

At step #1600 the multi-angle seamless switching 
flag VOB_FsV indicating whether angle changes within 
the multi-angle scene period are seamless or not is 
reset, and the procedure moves to step #1800. 

However, if step #1500 returns YES, that is, seam- 
less switching occurs, the procedure moves to step 
#1700. 

At step #1700 the multi-angle seamless switching 
flag VOB_FsV is set, and the procedure moves to step 
#1800. 

Therefore, as thus described, the procedure 
advances to step #1800 after the editing information is 
detected from the above flag settings in the scenario 
data St7 reflecting the user-defined editing instructions. 

In step #1800, based on the user-defined editing 
instructions detected from the above flag settings, infor- 
mation is added to the encoding information tables for 
the VOB Set units and VOB units shown in Fig. 13 and 
Fig. 14 for source stream encoding, and the encoding 
parameters of the VOB data units shown in Fig. 15 are 
produced. The procedure then moves to step #1900. 

The encoding parameter generation step is 
described in greater detail below referring to Fig. 22, 
Fig. 23. Fig. 24, and Fig. 25. 

Based on the encoding parameters produced in 
step #1800, the video data and audio data are encoded 
in step #1900, and the procedure moves to step #2000. 

Note that the subpicture data is normally inserted 
during video reproduction on an as-needed basis, and 
contiguity with the preceding and following scenes is 
therefore not usually necessary. Moreover, the subpic- 
ture data is normally video information for one frame, 
and unlike audio and video data having an extended 
time-base, subpicture data is usually static, and is not 
normally presented for a sustained period. Therefore, 
because the present invention relates specifically to 
seamless and non-seamless contiguous reproduction 
as described above, description of subpicture data 
encoding is omitted herein for simplicity. 

Step #2000 causes the loop comprising step #300 
to step #1900 to be repealed as many limes as there are 
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VOB Sets. This loop formats the program chain 
VTS_PGC#i to contain the reproduction sequence and 
other reproduction information for each VOB in the title 
(Fig. 5) in the program chain data structure, interleaves 
the VOBs in the multi-scene periods, and completes the s 
VOB Set data stream and VOB data stream needed for 
system stream encoding. The procedure then moves to 
step #2100. 

At step #2100, the total number of VOB Sets 
VOBS_NUM obtained as a result of the loop through 10 
step #2000 is added to the VOB Set data stream, the 
number of titles TITLE_NO defining the number of sce- 
nario reproduction paths as the number of titles in the 
scenario data St7 is set, and the VOB Set data stream 
is completed as the encoding information table. The 15 
procedure then moves to step #2200. 

System stream encoding producing the VOB 
(VOB#i) data in the VTS title VOBS (VTSTTVOBS) 
(Fig. 5) is accomplished in step #2200 based on the 
encoded video stream and encoded audio stream out- 20 
put from step #1900, and the encoding parameters in 
Fig. 15. The procedure then moves to step #2300. 

At step #2300, the VTS information, VTSI manage- 
ment table VTSLMAT, VTSPGC information table 
VTSPGCIT , and the program chain information 2s 
VTS_PGCI#i controlling the VOB data reproduction 
sequence shown in Fig. 5 are produced, and formatting 
to, for example, interleave the VOB contained in the 
multi-scene periods, is accomplished. The specific 
steps executed in this formatting operation are 30 
described below with reference to Fig. 27, Fig. 28, Fig. 
29, Fig. 30, and Fig. 31. 

The encoding parameter generation subroutine 
shown as step #1800 in Fig. 21 is described next with 
reference to Fig. 22, Fig. 23, and Fig. 24, using by way 35 
of example an operation for generating the encoding 
parameters for multi-angle control. 

Starting from Fig. 22, the process for generating the 
encoding parameters of a non-seamless switching 
stream with multi-angle control is described first. This 40 
stream is generated when step #1500 in Fig. 21 returns 
NO and the following flags are set as shown: VOB^Fsb 
= 1 or VOB_Fsf = 1 , VOB_Fp = 1 , VOB_Fi = 1 , VOB_Fm 
= 1, and VOB_FsV = 0. The following operation pro- 
duces the encoding information tables shown in Fig. 13 45 
and Fig. 14, and the encoding parameters shown in Fig. 
15. 

At step #1812, the scenario reproduction sequence 
contained in the scenario data St7 is extracted, the VOB 
Set number VOBSJsJO is set, and the VOB number so 
VOB_NO is set for one or more VOB in the VOB Set. 

At step #181 4 the maximum bit rate ILV_BR of the 
interleaved VOB is extracted from the scenario data 
St 7, and the maximum video encoding bit rate 
V_MRATE from the encoding parameters is set based ss 
on the interleave flag VOB_Fi setting (= 1). 

At step #1816, the minimum interleaved unit pres- 
entation time ILVUJ/IT is extracted from the scenario 



data St7. 

At step #1818, the video encoding GOP structure 
GOPST values N = 15 and M = 3 are set, and the GOP 
structure fixing flag GOPFXflag is set (= 1), based on 
the multi-angle flag VOB_Fp setting (= 1). 

Step #1820 is the common VOB data setting rou- 
tine. 

The common VOB data setting routine in step 
#1820 is described below referring to the flow chart in 
Fig. 23. This common VOB data setting routine pro- 
duces the encoding information tables shown in Fig. 13 
and Fig. 14, and the encoding parameters shown in Rg. 
15. 

At step #1822 the video material start time 
VOB_VST and video material end time VOB_VEND are 
extracted for each VOB from the scenario data St7, and 
the video encoding start time V_STTM and video 
encoding end time V_ENDTM are used as video encod- 
ing parameters. 

At step #1824 the audio material start time 
VOB_AST of each VOB is extracted from the scenario 
data St7, and the audio encoding start lime A_STTM is 
set as an audio encoding parameter. 

At step #1826 the audio material end time 
VOB_AEND is extracted for each VOB from the sce- 
nario data St7, and the audio encoding end time 
A_ENDTM, which is an audio encoding parameter, is 
set to a time of an audio access unit (AAU) not exceed- 
ing the VOB_AEND time. Note that the audio access - 
unit AAU is determined by the audio encoding method. 

At step #1828 the audio start gap A_STGAP 
obtained from the difference between the video encod- 
ing start time V_STTM and the audio encoding start 
time A_STTM is defined as a system encoding parame- 
ter. 

At step #1830 the audio end gap A_ENDGAP 
obtained from the difference between the video encod- 
ing end time VJENDTM and the audio encoding end 
time A_ENDTM is defined as a system encoding 
parameter. 

At step #1832 the video encoding bit rate V_BR is 
extracted from the scenario data St7, and the video 
encoding bit rate V_RATE, which is the average bit rate 
of video encoding, is set as a video encoding parame- 
ter. 

At step #1834 the audio encoding bit rate A_BR is 
extracted from the scenario data St7, and the audio 
encoding bit rate A_RATE is set as an audio encoding 
parameter. 

At step #1836 the video material type 
VOB_VJ<IND is extracted from the scenario data St7. If 
the material is film, that is, a telecine converted material, 
reverse telecine conversion is set for the video encoding 
mode V_ENCMD, and defined as a video encoding 
parameter. 

At step #1838 the audio encoding method 
VOB_A_KIND is extracted from the scenario data St7, 
and the encoding method is set as the audio encoding 
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mode AJENCMD and set as an audio encoding param- 
eter. 

At step #1840 the initial value of the initial video 
encocfing data VJNTST set in the VBV buffer is set to a 
value less than the VBV buffer end value of the last s 
video encoding data V_ENDST, and is defined as a 
video encoding parameter. 

At step #1842 the VOB number VOB_NO of the 
preceding connection is set to the preceding VOB 
number B_VOB_NO based on the setting (= 1) of the 10 
preceding VOB seamless connection flag VOB_Fsb 
and set as a system encoding parameter. 

At step #1344 the VOB number VOBJMO of the fol- 
lowing connection is set to the following VOB number 
F_VOB_NO based on the setting (= 1) of the following 15 
VOB seamless connection flag VOB_Fsf, and set as a 
system encoding parameter. 

The encoding information table and encoding 
parameters are thus generated for a multi-angle VOB 
Set with non-seamless multi-angle switching control 20 
enabled. 

The process for generating the encode parameters 
of a seamless switching stream with multi-angle control 
is described below with reference to Fig. 24. This 
stream is generated when step #1 500 in Fig. 21 returns 25 
YES and the following flags are set as shown: VOB_Fsb 
= 1 or VOB_Fsf = 1 , VOB_Fp = 1 , VOB_Fi = 1 , VOB_Fm 
= 1,andVOB_FsV = 1. 

The following operation produces the encoding 
information tables shown in Rg. 1 3 and Fig. 1 4, and the 30 
encode parameters shown in Fig. 15. 

At step #1850, the scenario reproduction sequence 
contained in the scenario data St7 is extracted, the VOB 
Set number VOBS_NO is set, and the VOB number 
VOBJMO is set for one or more VOB in the VOB Set. 35 

At step #1852 the maximum bit rate ILV_BR of the 
interleaved VOB is extracted from the scenario data 
St7, and is set to the maximum video encoding bit rate 
V_RATE based on the interleave flag VOB_R setting (= 
1). 40 

At step #1854, the minimum interleaved unit pres- 
entation time ILVU_MT is extracted from the scenario 
data St7. 

At step #1856, the video encoding GOP structure 
GOPST values N = 15 and M = 3 are set, and the GOP 45 
structure fixing flag GOPFXflag is set (= 1), based on 
the multi-angle flag VOB_Fp setting (= 1). 

At step #1858, the video encoding GOP structure 
GOPST is set to "closed GOP" based on the seamless 
switching flag VOB_FsV setting (= 1), and the video so 
encoding parameters are thus defined, 

Step #1860 is the common VOB data setting rou- 
tine. This common routine is the routine shown in Fig. 
22. and has already been described. Further descrip- 
tion thereof is thus omitted here. ss 

The encode parameters of a seamless switching 
stream with multi-angle control are thus defined for a 
VOB Set with multi-angle control as described above. 



The process for generaling the encode parameters 
for a system stream in which parental lock control is 
implemented is described below with reference to Rg. 
25. This stream is generated when step #1 200 in Rg. 20 
returns NO and step #1304 returns YES, that is, the fol- 
lowing flags are set as shown: VOB_Fsb = 1 or 
VOB_Fsf = 1 , VOB_Fp = 1 , VOB_R = 1 , VOB_Fm = 0. 

The following operation generates the encoding 
information tables shown in Rg. 13 and Rg. 14, and the 
encode parameters shown in Rg. 15. 

At step #1870, the scenario reproduction sequence 
contained in the scenario data St7 is extracted, the VOB 
Set number VOBS_NO is set. and the VOB number 
VOB_NO is set for one or more VOB in the VOB Set 

At step #1872 the maximum bit rate ILV_BR of the 
interleaved VOB is extracted from the scenario data 
St7, and the maximum video encoding bit rate V_RATE 
is set based on the interleave flag VOB_R setting (= 1). 

At step #1874 the number of interleaved VOB divi- 
sions ILV_DIV is extracted from the scenario data St7. 

Step #1876 is the common VOB data setting rou- 
tine. This common routine is the routine shown in Rg. 
22, and has already been described. Further descrip- 
tion thereof is thus omitted here. 

The encode parameters of a system stream in 
which parental lock control is implemented are thus 
generated for a VOB Set with multi-scene selection con- 
trol enabled as described above. 

The process for generating the encode parameters 
for a system stream containing a single scene is 
described below with reference to Fig. 26. This stream 
is generated when step #900 in Rg. 20 returns NO, that 
is, when VOB_Fp = 0. The following operation produces 
the encoding information tables shown in Rg. 13 and 
Rg. 14, and the encode parameters shown in Rg. 15. 

At step #1880, the scenario reproduction sequence 
contained in the scenario data St 7 is extracted, the VOB 
Set number VOBS_.NO is set, and the VOB number 
VOB_NO is set for one or more VOB in the VOB Set 

At step #1882 the maximum bit rate ILV_BR of the 
interleaved VOB is extracted from the scenario data 
St7, and the maximum video encoding bit rate 
VJvlRATE is set based on the interleave flag VOB_R 
setting (= 1). 

Step #1884 is the common VOB data setting rou- 
tine. This common routine is the routine shown in Rg. 
22, and has already been described. Further descrip- 
tion thereof is thus omitted here. 

The encoding parameters for DVD video, audio, 
and system stream encoding, and the DVD formatter, 
are thus generated as a result of generating the encod- 
ing information table and encode parameters by means 
of the flow charts described above. 

( 3.6.2 )Fprmatter flow chart Q^criptions 

The operation of the formatter subroutine for gener- 
ating a DVD multimedia stream shown as step #2300 in 



28 



55 



EP0 875 856A1 



56 



Fig. 21 is described next with reference to Fig. 27, Fig. 
28, Fig. 29, Fig. 30, and Fig. 31. 

The operation of the DVD encoder ECD 1100 
according to the present invention is described with ref- 
erence to the flow chart in Fig. 27. Note that those steps 
shown with a double line are subroutines. 

At step #2310 the program chain information 
VTSLPGCI is set to the video title set management 
table VTSLMAT in the VTSI for the number of titles 
TITLEJMUM based on the number of titles TI7l£_NUM 
in the VOB Set data stream. 

At step #2312 it is determined whether multi-scene 
selection control is enabled based on the multi-scene 
flag VOB_Fp in the VOB Set data stream. If step #21 1 2 
returns NO, that is, multi-scene control is not enabled, 
the procedure moves to step #21 14. 

Step #2314 represents a subroutine of the opera- 
tion of formatter 1100 of the authoring encoder EC 
shown in Fig. 12 for encoding a single VOB. This sub- 
routine is described later. 

If step #2312 returns YES, that is, multi-scene con- 
trol is enabled, the procedure moves to step #2316. 

At step #231 6 it is determined whether the informa- 
tion is to be interleaved or not based on the interleave 
flag VOB_Fi in the VOB Set data stream. If step #2316 
returns NO, that is, the information is not to be inter- 
leaved, the procedure moves to step #2314. 

At step #231 8 it is determined whether multi-angle 
control is to be implemented based on the multi-angle 
flag VOB_Fm in the VOB Set data stream. If step #2318 
returns NO, that is, multi-angle control is not enabled, 
the parental lock control routine in step #2320 is exe- 
cuted. 

At step #2320 the operation for formatting the VOB 
Set for parental lock control is executed. This subroutine 
is shown in Fig. 30 and described below. 

If step #2320 returns YES, that is, multi-angle con- 
trol is enabled, the procedure advances to step #2322. 

At step #2322 it is determined whether seamless 
switching is required based on the multi-angle seamless 
switching flag VOB_FsV. If multi-angle switching is 
accomplished without seamless switching, that is, with 
non-seamless switching and step #2322 returns NO, 
the procedure moves to step #2326. 

The multi-angle non-seamless switching control 
routine executed in step #2326 by the formatter 1 100 of 
the authoring encoder EC in Fig. 12 is described later 
with reference to Fig. 28. 

If multi-angle switching is accomplished with seam- 
less switching control, that is, step #2322 returns YES, 
the procedure moves to step #2324. 

The multi-angle seamless switching control routine 
executed in step #2324 by the formatter 1100 is 
described later with reference to Fig. 29. 

The cell playback information CPBI set in a previ- 
ous routine is recorded as the CPBI information of the 
VTSI in step #2328. 

At step #2330 it is determined whether all VOB Sets 



declared by the VOB Set number VOBSJMUM have 
been processed by the formaher. If step #2130 returns 
NO, that is, all VOB sets have not been processed, con- 
trol proceeds to step #2112. 
5 If step #2130 returns YES, that is, all sets have 
been formatted, the procedure terminates. 

Referring to Fig. 28, the multi-angle non-seamless 
switching control routine executed in step #2326 when 
step #2322, Fig. 27, returns NO is described. This rou- 
te tine defines the interleaved arrangement of the multime- 
dia bitstream, the content of the cell playback 
information (C_PBI#i) shown in Fig. 5, and the informa- 
tion stored to the navigation pack NV shown in Fig. 6, in 
the generated DVD multimedia bitstream. 
is At step #2340 based on the multi-angle flag 
VOB_Fm setting (=1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell 
block mode CBM (Fig. 5) of the cell (C_PBI #i in Fig. 5) 
containing the VOB control information for each scene 
20 is recorded using such values as 01b to indicate the 
beginning of the cell block for the cell block mode CBM 
of the MA1 ceil (Fig. 10), 10b to indicate a cell between 
the first and last cells in the block for the CBM of MA2, 
and 1 1 b to indicate the end of the cell block for the CBM 
25 of MA3. 

At step #2342 based on the multi-angle flag 
VOB_Fm setting (=1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell 
block type CBT (Fig. 5) of the cell playback information 

30 blocks C_PBI #i containing the VOB control information 
for each scene is declared as 01b to indicate an "angle." 

At step #2344 the seamless playbackf lag SPF (Fig. 
5) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for 

35 each scene based on VOB_Fsb, which indicates 
whether the connection is seamless and is set to 1 . 

At step #2346 the STC discontinuity flag STCDF 
(Fig. 5) is set to 1 in the cell playback information blocks 
C_PBI #i (Fig. 5) containing the VOB control information 

40 for each scene based on VOB_Fsb which indicates 
whether the connection is seamless and is set to 1 . 

At step #2348 the interleaved allocation flag IAF 
(Fig. 5) is set to 1 in the cell playback information blocks 
C_PBl #i (Fig. 5) containing the VOB control information 

45 for each scene based on VOB_FsV, which is set to 1 to 
indicate interleaving is required. 

At step #2350 the location of the navigation pack 
NV (relative sector number from the VOB beginning) is 
detected from the title editing unit (VOB below) obtained 

so from the system encoder 900 in Fig. 12, the navigation 
pack NV is detected based on the minimum interleaved 
unit presentation time ILVU_MT information (a formatter 
parameter obtained in step #1816, Fig. 21), the location 
of the VOBU expressed, for example, as the number of 

55 sectors from the VOB beginning is thus obtained, and 
the title editing unit VOB is divided into interleave units 
using VOBU units. 

For example, in this example the minimum inter- 



29 



57 



EP0875 856A1 



58 



leaved unit presentation time ILVU_MT is 2 sec and the 
presentation time of one VOBU is 0.5 sec., and the VOB 
is therefore divided into interleave units of 4 VOBU 
each. Note that this allocation operation is applied to the 
VOB constituting each mutti-scene data unit. 

At step #2352 the interleave units of each VOB 
obtained from step #2350 are arranged in the ceil block 
mode CBM sequence (cell block beginning, middle, and 
end cells) written as the VOB control information for 
each scene in step #2340 to form the interleaved blocks 
as shown in Fig. 18 or Fig. 34. The interleaved blocks 
are then added to the VTS title VOBS (VTSTT_VOBS). 
Using the cell block mode CBM declarations above, for 
example, the angle data MA1, MA2, and MA3 (Fig. 10) 
are arranged in that sequence. 

At step #2354 the relative sector number from the 
VOBU start is written to the VOB end pack address 
COBU_EA (Fig. 8) in the navigation pack NV of each 
VOBU based on the VOBU position information 
obtained in step #2350. 

At step #2356 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors 
from the beginning of the VTS title VOBS 
(VTSTTJ/OBS) are written as the addresses of the 
navigation packs NV of the first and last VOBU in each 
cell based on the VTS title VOBS (VTSTT_VOBS) data 
obtained in step #2352. 

The angle #i VOBU start address 
NSM L_AGL_C1 _DSTA to NSML_AGL_C9_DSTA of 
the non-seamless angle information NSML_AGU (Fig. 
8) in the navigation pack NV of each VOBU is written at 
step #2358. This address is expressed as the relative 
sector number inside the data of the interleaved blocks 
formed in step #2352, and declares the address infor- 
mation of the navigation pack NV contained in the 
VOBU of all angle scenes near the presentation start 
time of the VOBU being processed. 

At step #2360 TFFFFFFFh" is written to the angle 
#i VOBU start address N SM L_AG L_C 1 _DSTA to 
NSM L_AGL_C9_DSTA (Fig. 8) of the non-seamless 
angle information NSML_AGLI (Fig. 8) in the navigation 
pack NV of each VOBU if the VOBU being processed is 
the last VOBU of each scene in the mutti-scene period. 

This routine thus formats the interleaved blocks for 
multi-angle non-seamless switching control in the multi- 
scene period, and formats the cell control information as 
the reproduction control information for those multiple 
scenes. 

Referring to Fig. 29, the multi-angle seamless 
switching control routine executed in step #2324 when 
step #2322, Fig. 27, returns YES is described. This rou- 
tine defines the interleaved arrangement of the multime- 
dia bitstream, the content of the cell playback 
information (C_PBI#i) shown in Fig. 5, and the informa- 
tion stored to the navigation pack NV shown in Fig. 8, in 
the generated DVD multimedia bitstream. 

At step #2370 based on the multiangle flag 



VOB_Fm setting (=1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell 
block mode CBM (Fig. 5) of the cell playback informa- 
tion blocks C_PBI $ (Fig. 5) containing the VOB control 

5 information for each scene is declared according to the 
position of the angle data. For example, the cell block 
mode CBM of the MA1 cell (Fig. 10) is declared as 01b 
to indicate the beginning of the cell block, the CBM of 
MA2 is declared as 10b to indicate a cell between the 

;o first and last cells in the block, and the CBM of MA3 is 
declared as 1 1b to indicate the end of the cell block. 

At step #2372 based on the multi-angle flag 
VOB_Fm setting (=1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell 

is block type CBT (Fig. 5) of the cell playback information 
blocks C_PBI #i (Fig. 5) containing the VOB control 
information for each scene is declared as 01b to indi- 
cate an "angle." 

At step #2374 the seamless playback flag SPF (Fig. 

20 5) is set to 1 in the cell playback information blocks 
C_PBI #i (Fig. 5) containing the VOB control information 
for each scene based on VOB__Fsb, which indicates 
whether the connection is seamless and is set to 1 . 
At step #2376 the STC discontinuity flag STCDF 

25 (Fig. 5) is set to 1 in the cell playback information blocks 
C_PBI #i (Fig. 5) containing the VOB control information 
for each scene based on VOB_Fsb, which indicates 
whether the connection is seamless and is set to 1 . 
At step #2378 the interleaved allocation flag IAF 

30 (Fig. 5) is set to 1 in the cell playback information blocks 
C_PBI #i (Fig. 5) containing the VOB control information 
for each scene based on VOB_FsV, which is set to 1 to 
indicate interleaving is required. 

At step #2380 the location of the navigation pack 

35 NV (relative sector number from the VOB beginning) is 
detected from the title editing unit (VOB below) obtained 
from the system encoder 900 in Fig. 1 2, the navigation 
pack NV is detected based on the minimum interleaved 
unit presentation time ILVU_MT information (a formatter 

40 parameter obtained in step #1 854, Fig. 24), the location 
of the VOBU (expressed as the number of sectors from 
the VOB beginning, for example) is thus obtained, and 
the title editing unit VOB is divided into interleave units 
using VOBU units. 

45 For example, the minimum interleaved unit presen- 
tation time is 2 sec and the presentation time of one 
VOBU is 0.5 sec. in the previous example, and a VOB is 
therefore divided into interleave units of 4 VOBU each. 
Note that this allocation operation is applied to the VOB 

so constituting each multi-scene data unit 

At step #2382, the interleave units of each VOB 
obtained from step #1 852 are arranged according to the 
cell block mode CBM (Fig. 5) sequence (ceil block 
beginning, middle, and end cells) as the VOB control 

55 information for each scene recorded in step #2160 to 
form the interleaved blocks as shown in Fig. 18 or Fig. 
34. The interleaved blocks are then added to the 
VTSTT_VOBS. Using the cell block mode CBM declara- 
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tions above, for example, MA1 , MA2, and MA3 (Fig. 10) 
are arranged in that sequence. 

At step #2384 the relative sector number from the 
VOBU start is written to the VOB end pack address 
COBU_EA (Fig. 8) in the navigation pack NV of each s 
VOBU based on the VOBU position information 
obtained in step #2360. 

At step #2386 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors 10 
from the beginning of the VTS title VOBS 
(VTSTTVOBS) are written as the addresses of the 
navigation packs NV of the first and last VOBU in each 
cell based on the VTS title VOBS (VTSTT_VOBS) data 
obtained in step #2382. is 

At step #2388 the relative sector number to the last 
pack in the interleave unit is written to the ILVU end pack 
address ILVU_EA (Fig. 8) in the navigation pack NV of 
each VOBU in an interleave unit based on the interleave 
unit data obtained in step #2370. 20 

The angle #i VOBU start address 
SML_AGL_C1_DSTA to SML_AGL_C9_DSTA of the 
seamless angle information SML_AGU (Fig. 8) in the 
navigation pack NV of each VOBU is written at step 
#2390. This address is expressed as the relative sector 25 
number inside the data of the interleaved blocks formed 
in step #2382, and declares the address information of 
the navigation pack NV contained in the_ VOBU of all 
angle scenes with a start time contiguous to the repro- 
duction end time of the VOBU being processed. 30 

At step #2392 VFFFFFFFh" is written to the angle 
#i VOBU start address SML_AGL_C1_DSTA to 
SML_AGL_C9_DSTA (Fig. 8) of the seamless angle 
information SML_AGLI (Fig. 8) in the navigation pack 
NV of the VOBU contained in the interleaved unit if the 35 
interleave unit arranged in step #2382 is the last inter- 
leave unit of each scene in the multi-scene period. 

This routine thus formats the interleaved blocks for 
multi-angle seamless switching control in a multi-scene 
period, and formats the cell control information as the *o 
reproduction control information for those multiple 
scenes. 

The parental lock control subroutine (step #2320) 
executed when step #2318 in Fig. 27 returns NO, that is, 
when it is determined that parental lock control is imple- 45 
mented and not multi-angle control, is described next 
with reference to Fig. 30. 

The parental lock subroutine described below 
writes the interleave unit arrangement of the multimedia 
bitstream, the content of the cell playback information so 
(C_PBI #i in Fig. 5), and the navigation pack NV infor- 
mation shown in Fig. 8, to the generated DVD multime- 
dia bitstream. 

At step #2402 M 00b" is written to the cell block mode 
CBM (Fig. 5) of the cell playback information C_PBI #i ss 
(Fig. 5) containing the VOB control information for each 
scene based on VOB_Fm, which is set to 0 to indicate 
that multi-angle control is not enabled in the multi-scene 
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period. 

At step #2404 the seamless playbackflag SPF (Fig. 
5) is set to 1 in the cell playback information C_PBI #i 
(Fig. 5) containing the VOB control information for each 
scene based on VOB_Fsb. which indicates whether the 
connection is seamless and is set to 1 . 

At step #2406 the STC discontinuity flag STCDF is 
set to 1 in the cell playback information C_PBI #i (Fig. 5) 
containing the VOB control information for each scene 
based on VOB_Fsb, which indicates whether the con- 
nection is seamless and is set to 1 . 

At step #2408 the interleaved allocation flag IAF 
(Fig. 5) is set to 1 in the cell playback information C_PBI 
#i (Fig. 5) containing the VOB control information for 
each scene based on VOB_FsV, which is set to 1 to indi- 
cate interleaving is required. 

At step #2410 the navigation pack NV position infor- 
mation (the relative sector number from the VOB start) 
is detected from the title editing unit (VOB) obtained 
from the system encoder 900 (Fig. 12). The navigation 
pack NV is then detected based on the number of inter- 
leaved VOB divisions ILVJDIV a formatter parameter 
obtained in step #1874 in Fig. 25, to obtain the VOBU 
position information (number of sectors from the VOB 
start), and divide each VOB into the specified number of 
interleave units in VOBU units. 

At step #2412 the interleave units obtained in step 
#2410 are then interleaved. For example, the interleave 
units are arranged in ascending VOB number sequence 
to create the interleaved blocks as shown in Fig. 18 or 
Fig. 34, and the interleaved blocks are added to 
VTSTT_VOBS. 

At step #2414 the relative sector number from the 
VOBU start is written to the VOB end pack address 
COBU_EA (Fig. 8) in the navigation pack NV of each 
VOBU based on the VOBU position information 
obtained in step #2186. 

At step #2416 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors 
from the beginning of the VTS title VOBS (VTSTT- 
VOBS) are written as the addresses of the navigation 
packs NV of the first and last VOBU in each cell based 
on the VTS title VOBS (VTSTT_VOBS) data obtained in 
step #2412. 

At step #2418 the relative sector number to the last 
interleave unit pack is written to the ILVU end pack 
address ILVU_EA (Fig. 8) in the navigation pack NV of 
the VOBU forming the interleaved units based on the 
interleaved unit data obtained from step #2412. 

At step #2420, the relative sector number in the 
interleaved block data formed in step #2412 is written to 
the next-ILVU start address NT_ILVU_SA as the loca- 
tion information of the next ILVU in the navigation packs 
NV of the VOBU contained in the interleaved unit ILVU. 

At step #2422 the interleaved unit flag ILVUflag is 
set to 1 in the navigation packs NV of the VOBU con- 
tained in the interleaved unit ILVU. 



EP 0 875 856 A1 



31 



61 



EP0 875 856 A1 



62 



At step #2424, the Unit END flag UnitENDflag of the 
navigation pack NV in the last VOBU of the interleaved 
unitlLVU is set to 1. 

At step #2426 "FFFFFFFFh" is written to the next- 
ILVU start address NTJLVUSA of the navigation pack 
NV of the VOBU in the last interleaved unit ILVU of each 
VOB. 

The operation described above thus formats the 
interleaved blocks to enable parental lock control in 
multi-scene periods, and formats the control information 
in the cells, that is. the cell playback control information 
for the multi-scene periods. 

The subroutine executed as step #23 1 4 when steps 
#2312 and #2316 in Fig. 27 return NO, that is. when the 
scene is determined to be a single scene and not a 
multi-scene period, is described next with reference to 
Fig. 31. 

The single scene subroutine described below 
writes the interleave unit arrangement of the multimedia 
bitstream, the content of the cell playback information 
C_PBI #i shown in Fig. 5, and the navigation pack NV 
information shown in Fig. 8, to the generated DVD mul- 
timedia bitstream. 

At step #2430 a value "00b" indicating a "non-cell 
block" is written to the cell block mode CBM (Fig. 5) of 
the cell playback information C_PBI #i containing the 
VOB control information for each scene based on 
VOB_Fp, which is set to 0 to indicate that the scene is a 
single scene and not part of a multiscene period. 

At step #2432 the interleaved allocation flag IAF 
(Fig. 5) is set to 0 in the cell playback information C_PBI 
#i containing the VOB control information for each 
scene based on VOB_FsV, which is set to 0 to indicate 
interleaving is not required. 

At step #2434 the navigation pack NV position infor- 
mation (the relative sector number from the VOB start) 
is detected from the title editing unit (VOB) obtained 
from the system encoder 900 (Fig. 12). placed in the 
VOBU unit, and added to the VTSTTVOBS, the video 
and other stream data of the multimedia bitstream. 

At step #2436 the relative sector number from the 
VOBU start is written to the VOB end pack address 
COBU_EA (Fig. 8) in the navigation pack NV of each 
VOBU based on the VOBU position information 
obtained in step #2434. 

At step #2438, the address of the navigation pack 
of the first VOBU in each cell, and the address of the 
navigation pack NV of the last VOBU in each cell are 
extracted. The number of sectors from the beginning of 
the VTSTTVOBS is recorded as the first cell VOBU 
start address C_FVOBU_SA, and the number of sec- 
tors from the end of the VTSTT_VOBS is recorded as 
the last cell VOBU start address C_LVOBU_SA. 

At step #2440 the state determined as a result of 
step #300 or step #600 in Fig. 20. that is, whether 
VOB_Fsb is set to 1 indicating a seamless connection 
to the preceding or following scenes, is evaluated. H 
step #2440 returns YES. the procedure moves to step 



#2442. 

At step #2442 the seamless playbackf lag SPF (Fig. 
5) is set to 1 in the cell playback information C_PBI #i 
(Fig. 5) containing the VOB control information for each 

5 scene based on VOB_Fsb, which indicates whether the 
connection is seamless and is set to 1 . 

At step #2444 the STC discontinuity flag STCDF is 
set to 1 in the cell playback information C_PBI #i (Fig. 5) 
containing the VOB control information for each scene 

io based on VOB_Fsb, which indicates whether the con- 
nection is seamless and is set to 1 . 

If step #2440 returns NO, that is, there is not a 
seamless connection to the preceding scene, the proce- 
dure moves to step #2446. 

15 At step #2446 the seamless playback flag SPF (Fig. 
5) is set to 0 in the cell playback information C.PBI #i 
(Hg. 5) containing the VOB control information for each 
scene based on VOB_Fsb, which indicates whether the 
connection is seamless and is set to 0. 

20 At step #2448 the STC discontinuity flag STCDF is 
set to 0 in the cell playback information C_PBI #i (Fig. 5) 
containing the VOB control information for each scene 
based on VOB_Fsb, which indicates whether the con- 
nection is seamless and is set to 0. 

25 The operation described above thus formats a mul- 
timedia bitstream for a single scene period, and records 
the cell playback information C_PBI #i (Fig. 5), and the 
information in the navigation pack NV (Fig. 8), to the 
generated DVD multimedia bitstream. 

so An authoring encoder EC according to the present 
invention is described in further detail below with refer- 
ence to an authoring encoder whereby alternative 
reproduction are possible in parental lock control and 
multi-angle control. Authoring encoding enabling alter- 

35 native reproduction can be broadly divided into the fol- 
lowing six processes. 

Process 1 : elementary encoding parameter gener- 
ation (EEParam) 

40 

* Enter parameters for source material (video 
information, audio information, subpicture 
information) 

* Specify material that will constitute angle peri- 

45 OdS. 

* Specify material that will constitute parental 
lock control periods. 

Process 2: Check EEParam and provide feedback 

50 

Angle period verification 

* Check uniformity of angle period materials. 
Confirm that the length of the video is 
55 equal. 

Confirm an equal number of audio and 
subpicture channels. 

Display an error if uniformity cannot be ver- 
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ified. 

Parental lock control period verification 

* Verify that the M value satisfies equation 1 
and equation 4. 

If the value of M is not acceptable, display 
an error. 

Process 3: Elementary stream encoding 

* Encode the source streams based on the EEP- 
aram values. 

Process 4: System stream encoding 

Process 5: Formatting 

Process 6: Produce disk. 

Note that processes 4, 5, and 6 are basically identi- 
cal to steps #1800 to #2300 described above with refer- 
ence to Rg. 20 and Fig. 21, and further description 
thereof is thus omitted below. 

An editing information generator 100R for generat- 
ing editing control command data St7 is described here 
with reference to Fig. 45. An editing information genera- 
tor 100R in the present example is similar in construc- 
tion to an editing information generator 100 shown in 
Fig. 37 and Fig. 38, and comprises a computer and con- 
trol software therefor, a video tape drive device, and a 
tape drive device. Referring to Fig. 45, 102R is an edit- 
ing information input means, 104R is an error informa- 
tion presentation means, 108R is an editing information 
error detector, 110R is an elementary stream encoding 
parameter generator, and 11 2R is an elementary 
stream input buffer. 

The editing information input means 102R is prefer- 
ably a text editor written in software for receiving infor- 
mation input by a user using a computer keyboard or 
other input device, and storing said information in com- 
puter memory (not shown in the figure). The stored data 
is a video table VTBL (VTBL: Video Table) as shown in 
Rg. 43, and an audio table ATBL (ATBL: Audio Table) as 
shown in Fig. 44. 

The editing information error detector 108R is pref- 
erably a software construction for verifying the content 
of the generated VTBL and ATBL, outputting an error 
value to the error information presentation means 104R 
if an inappropriate setting is detected, and outputting 
the VTBL and ATBL to the elementary stream encoding 
parameter generator 1 1 0R if there are no inappropriate 
settings. It should be noted that the operation of the 
editing information generator 100R is described further 
below in detail with reference to Fig. 46. 

The error information presentation means 104R is 
preferably a software construction for analyzing error 
values input from the editing information error detector 



108R, and providing visual feedback to he user. Specif- 
ically, based on information contained in an error value, 
it is determined whether a parameter in VTBL or ATBL 
is inappropriate, and presents on the display device of 

5 the computer a warning message to the effect that a 
parameter is inappropriate. 

The elementary stream input buffer 112R receives 
video material and audio material input from an external 
source. Typically, this is a video tape drive for receiving 

w video material, and a tape drive for receiving audio 
material and subpicture material. 

The elementary stream encoding parameter gener- 
ator 1 1 0R is preferably a software construction for ana- 
lyzing the VTBL and ATBL, and outputting parameters 

15 to the elementary stream input buffer 1 1 2 and external 
video encoder and audio encoder. 

Based on the VTBL shown in Fig. 43, for example, 
time code St6 is output to the elementary stream input 
buffer 112R, and the remaining information is output to 

20 the encoding system controller 200 as the editing con- 
trol command data St7R. 

Based on the ATBL shown in Rg. 44, for example, 
time code St6 is output to the elementary stream input 
buffer 112R, and the remaining information is output to 

25 the encoding system controller 200 as the editing con- 
trol command data St7R. 

The above processes are described in an actual 
example with reference to Fig. 42, Fig. 43, Fig. 44, Fig. 
45, and Fig. 46. 

30 Rg. 42 is an example of the video data for a pro- 
duced disk, and the reproduction sequence therefor. 

Referring to Fig. 42, period B is a parental lock con- 
trol period comprising video data v02 and v02cut. Video 
data v02cut represents video data from which a part of 

35 the video has been cut for the purpose of viewer restric- 
tions. Period C is an angle period comprising video data 
v04en, v04fr, v04es, and v04pt. Video data v04en, 
v04f r, v04es, and v04pt are video data captured at four 
angles. Operation is described in detail below in each 

40 process. 

Rrst, in the first process, parameter EEPararn for 
elementary stream encoding is generated. The editing 
information input means 102R receives user input, and 
generates video table VTBL, a parameter for video 

45 encoding, and audio table ATBL, a parameter for audio 
encoding. 

An example of a video table VTBL for the video 
encoding input data shown in Fig. 42 is shown in Rg. 
43. As shown in the figure, there are eight fields in the 
so video table VTBL shown as, in sequence from the left, 
VOB, Audio, SP, ATTR, 

STARTTC, END_TC, BR, and 132. These are 
defined as: 

55 

* VOB: MPEG stream name for interleaving 

* Audio: number of audio streams in the MPEG 
stream 
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* SP: number of subpicture streams in the 
MPEG stream 

* ATTR: attribute information; angle periods are 
"AG", parental lock control periods are "DC", 
and standard periods are "SL". s 

* STARTJC: start encoding code of the source 
tape 

* ENDJTC: end encoding code of the source 
tape 

* BR: BitRate 10 

* 132: reverse telecine conversion declaration 
flag 

An example of an audio table ATBL for the audio 
encoding input data is shown in Fig. 44. As shown in the 75 
figure, there are also eight fields in the audio table ATBL 
shown as from the left as VOB, STR_NO, MODE, 
STARTJC, ENDJTC, ATTR, BR, and FQ. These are 
defined as: 

20 

* VOB: MPEG stream name for interleaving 

* STR_NO: stream number 

* MODE: encoding mode, either AC3 or LPCM 

* START_TC: start encoding code of the source tape 

* ENDJTC: end encoding code of the source tape 2s 

* ATTR: attribute information; angle periods are "AG", 
parental lock control periods are "DC", and stand- 
ard periods are "SL". 

* BR: BitRate 

* FQ: sampling frequency 30 

Note that the sequence in which the fields are enu- 
merated matches the reproduction sequence. In addi- 
tion, angle and parental lock control periods are in a 
continuously enumerated format The sequence of enu- 35 
meration in an angle period is the angle number 
sequence. . 

Checking the elementary encoding parameter 
EE Pa ram and providing feedback occurs next in the 
second process. The error information presentation <o 
means 104R and editing information error detector 
108R perform verification of the multi-angle control peri- 
ods and parental lock control periods. Verification of the 
multi-angle control periods and parental lock control 
periods is described next below with reference to the 45 
flow chart in Fig. 46. 

First, at step S1, the BitRate is evaluated. In this 
BitRate evaluation, it is determined whether the BitRate 
of each VOB (MPEG stream) is equal to or less than a 
particular BitRate. so 

That is, the overall BitRate of the VOB is deter- 
mined based on: 

BitRate assigned to video information in the VOB: 
SR value of the VTBL ss 

* BitRate assigned to audio information in the VOB: 
SR value of the ATBL 

* BitRate assigned to subpicture information in the 



VOB: constant value 

* number of audio streams in the VOB: Audio value in 
the VTBL 

* number of subpicture streams in the VOB: SP value 
in the VTBL 

and it is determined whether the overall BitRate is 
less than the transfer rate to the video buffer. If the 
BitRate is not appropriate, control advances to step 
S9, and an error process is accomplished to, for 
example, present a warning message on a monitor 
device. 

The period is determined in step S3. First, each 
entry in the video table VTBL is inspected in sequence 
to detect angle periods and parental lock control peri- 
ods. More specifically, entries with an attribute value of 
"AG" are determined to be angles, and the set of contig- 
uous entries with an attribute of "AG" are determined to 
be video data entries forming an angle period. Parental 
lock control periods are determined in the same man- 
ner. When a parental lock control period is determined, 
control advances to step S5; when a multi-angle control 
period is determined, control advances to step S7. 

Verification of the parental lock control period 
occurs in step S5. The entries of each VOB stored as a 
parental lock control period are verified, and it is deter- 
mined whether the M value satisfies equations (1) and 
(4) above. An error process is executed if a value is not 
appropriate. (A warning message is displayed on a 
monitor.) 

Verification of the multi-angle control period occurs 
in step S7. It is determined whether the above-noted 
conditions of an angle period are satisfied, and whether 
the uniformity of each VOB associated with the angle 
period can be assured. An error process is executed if a 
value is not appropriate. (A warning message is dis- 
played on a monitor.) 

In step S9, the video table VTBL and audio table 
ATBL are output from editing information generator 
100R to the elementary stream encoding parameter 
generator 1 1 0R, and control then advances to the next 
(third) process. 

Elementary stream encoding occurs in the third 
process. The materials are encoded based on the ele- 
mentary encoding parameter EEParam by means of the 
elementary stream encoding parameter generator 
1 10R, elementary stream input buffer 1 12R, and encod- 
ing boards, etc. (not shown in the figure). The elemen- 
tary stream encoding parameter generator 110R 
controls the elementary stream input buffer 1 12R (tape 
drive, other) according to the video table VTBL and 
audio table ATBL, outputs the material data to an exter- 
nal encoder, and simultaneously outputs encoding 
parameters to the external encoder. It should be noted 
that in place of the elementary stream encoding param- 
eter generator 110R, the encoding system controller 
200 can be caused to control the elementary stream 
input buffer 11 2R. 
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Processes 4, 5, and 6 are basically identical to 
steps #1 800 to #2300 described above with reference to 
Fig. 20 and Fig. 21, and further description thereof is 
thus omitted below. 

It should be noted that description of process 4, 
process 5, and process 6 is omitted below. 

< 3.7 ) Decoder flow charts 

< 3.7.1 >PisK-tp-strear n b u ff er tra ns fe r 

The decoding information table produced by the 
decoding system controller 2300 based on the scenario 
selection data St51 is described below referring to Fig. 
32 and Fig. 33. The decoding information table com- 
prises the decoding system table shown in Fig. 32, and 
the decoding table shown in Fig. 33. 

As shown in Fig. 32, the decoding system table 
comprises a scenario information register and a ceil 
information register. The scenario information register 
records the title number and other scenario reproduc- 
tion information selected by the user and extracted from 
the scenario selection data St51. The cell information 
register extracts and records the information required to 
reproduce the cells constituting the program chain 
based on the user-defined scenario information 
extracted into the scenario information register. 

More specifically, the scenario information register 
contains plural sub-registers, that is, the angle number 
register ANGLE_NO_reg, VTS number register 
VTS_NO_reg, PGC number register 
VTS_PGCI_NO_reg, audio ID register AUDIOJD_reg, 
subpicture ID register SPJD_reg, and the SCR buffer 
register SCR_buffer. 

The angle number register ANGLE_NO__reg stores 
which angle is reproduced when there are multiple 
angles in the reproduced PGC. 

The VTS number register VTS_NO_reg records the 
number of the next VTS reproduced from among the 
plural VTS on the disk. 

The PGC number register VTS_PGCI_NO_reg 
records which of the plurality* of program chains PGC 
present in the video title set VTS is to be reproduced for 
parental lock control or other application. 

The audio ID register AUDIO_ID_reg records which 
of the plurality of audio streams in the VTS are to be 
reproduced. 

The subpicture ID register SPJD_reg records 
which of the plurality of subpicture streams is to be 
reproduced when there are plural subpicture streams in 
the VTS. 

The SCR buffer SCRJbuffer is the buffer for tempo- 
rarily storing the SCR recorded to the pack header as 
shown in Fig. 7. As described with reference to Fig. 3. 
this temporarily stored system clock reference SCR is 
output to the decoding system controller 2300 as the bit- 
stream control data St63. 

The cell information register contains the following 



sub-registers: the cell block mode register CBM_reg, 
cell block type register CBTjeg, seamless playback 
flag register SPBj-eg, interleaved allocation flag regis- 
ter IAF_reg, STC discontinuity flag register STCDF, 

5 seamless angle change flag register SACF_reg, first 
cell VOBU start address register C_FVOBU_SA_reg, 
and last cell VOBU start address register 
C_LVOBU_SA_reg. 

The cell block mode register CBM_reg stores a 

10 value indicating whether plural cells constitute one func- 
tional block. If there are not plural cells in one functional 
block, CBM_reg stores N_BLOCK. If a plurality of cells 
constitute one functional block, the value F_CELL is 
stored in the first cell in the functional block, L_CELL is 

15 stored in the last cell in the functional block, and BLOCK 
is stored in all cells between the first and last cells in the 
functional block. 

The cell block type register CBTjeg stores a value 
defining the type of the block indicated by the cell block 

20 mode CBM_reg. If the cell block is a multi-angle block, 
A_BLOCK is stored; if not, N_BLOCK is stored. 

The seamless reproduction flag register SPF_reg 
stores a value defining whether that cell is seamlessly 
connected with the cell or cell block reproduced there- 

25 before. If a seamless connection is specified, SML is 
stored; if a seamless connection is not specified, NSML 
is stored. 

The interleaved allocation flag register IAF_reg 
stores a value identifying whether the cell exists in an 

30 interleaved block. If the cell is part of a an interleaved 
block, ILVB is stored; otherwise NJLVB is stored. 

The STC discontinuity flag register STCDF defines 
whether the system time dock STC used for synchroni- 
zation must be reset when the cell is reproduced; when 

35 resetting the system time clock STC is necessary, 
STC_RESET is stored; if resetting is not necessary, 
STC_N RESET is stored. 

The first cell VOBU start address register 
C_FVOBU_SA_reg stores the VOBU start address of 

40 the first cell in a block. The value of this address is 
expressed as the distance from the logic sector of the 
first cell in the VTS title VOBS (VTSTT_VOBS) as 
measured by and recorded as the number of sectors. 
The last cell VOBU start address register 

45 C_LVOBU_SA_reg stores the VOBU start address of 
the last cell in the block. The value of this address is 
expressed as the distance from the logic sector of the 
first cell in the VTS title VOBS (VTSTTVOBS) meas- 
ured by and recorded as the number of sectors. 

so The decoding table shown in Fig. 33 is described 
below. As shown in the figure, the decoding table com- 
prises the following registers: an information register for 
non-seamless multi-angle control, an information regis- 
ter for seamless multi-angle control, a VOBU informa- 

55 tion register, and an information register for seamless 
reproduction. 

The information register for non-seamless multi- 
angle control comprise NSML_AGL_C 1_DSTAjeg to 
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NSM L_AGL_C9_DSTA_reg. 

NSML_AGL_C1_DSTA_reg to 
NSM L_AGL_C9_DSTA_jeg record the 

NMSL_AGL_C1_DSTA to NMSL_AGL_C9_DSTA val- 
ues in the PC1 packet shown in Fig. 8. 

The information register for seamless multi-angle 
control comprises SML_AGL_C 1 _DSTA_jeg to 
SML_AGL_C9_DSTA_reg . 

SML_AGL_C1_DSTA_reg to 
SML_AGL_C9_DSTA_reg record the 

SML_AGL_C1 JDSTA to SML_AGL_C9_DSTA values in 
the DSI packet shown in Fig. 8. 

The VOBU information register stores the end pack 
address VOBU_EA. 

The information register for seamless reproduction 
comprises: an interleaved unit flag register 
ILVUJIag_reg, Unit END flag register 
UNIT_ENDJIag_reg, ILVU End Address register 
ILVU_EA_reg, Next Interleaved Unit Start Address reg- 
ister NTJLVU_SA_reg, the presentation start time of 
the first video frame in the VOB register 
VOB_V_SPTM_jeg, the presentation end time of the 
last video frame in the VOB register 
VOB_V_EPTM__reg 1 audio reproduction stopping time 1 
register VOB_A_GAPJ 5 TM1_reg, audio reproduction 
stopping time 2 register VOB_A_GAP_PTM2_reg, 
audio reproduction stopping period 1 register 
VOB_A_GAP_LEN1 _reg, and audio reproduction stop- 
ping period 2 register VOB_A_GAP_LEN2_reg. 

The interleaved unit flag register ILVU_flag_reg 
stores a value indicating whether the video object unit 
VOBU is in an interleaved block, and stores ILVU if it is, 
and NJLVU if not. 

The Unit END flag register UNIT_ENDJlag_reg 
stores a value indicating whether the video object unit 
VOBU is the last VOBU in the interleaved unit ILVU if the 
VOBU is in an interleaved block. Because the inter- 
leaved unit ILVU is the data unit for continuous reading, 
END is stored if the VOBU currently being read is the 
last VOBU in the interleaved unit ILVU. and N_END is 
otherwise stored. 

The Interleaved Unit End Address register 
ILVU_EA_reg stores the address of the last pack in the 
ILVU to which the VOBU belongs if the VOBU is in an 
interleaved block. This address is expressed as the 
number of sectors from the navigation pack NV of that 
VOBU. 

The Next Interleaved Unit Start Address register 
NT_ILVU_SA_reg stores the start address of the next 
interleaved unit ILVU if the VOBU is in an interleaved 
block. This address is expressed as the number of sec- 
tors from the navigation pack NV of that VOBU. 

The Initial Video Frame Presentation Start Time 
register VOB_V_SPTM_reg stores the time at which 
presentation of the first video frame in the VOB starts. 

The Final Video Frame Presentation Termination 
Time register VOB_V_EPTM_reg stores the time at 
which presentation of the last video frame in the VOB 



ends. 

The audio reproduction stopping time 1 register 
VOB_A_GAP_PTM1_reg stores the time at which the 
audio is to be stopped, and the audio reproduction stop- 
5 ping period 1 register VOB_AjGAP_LEN1_reg stores 
the length of time the audio is stopped. 

The audio reproduction stopping time 2 register 
VOB_A_GAP_PTM2_reg and audio reproduction stop- 
ping period 2 register VO B_A_G AP_LE N2_r eg store 
w the same values. 

<3.8) DVD player 

An example of a reproduction apparatus for a DVD, 

15 which is an optical disk to which is recorded a multime- 
dia bitstream generated by a multimedia optical disk 
authoring system according to the present invention, is 
shown in Fig. 41. A DVD is used stored in a tray of a 
DVD player 1 . The DVD player 1 is connected to a tele- 

20 vision monitor 2 whereby video and audio reproduced 
from the DVD are presented. Note that a remote control 
91 is also provided for the user to operate the DVD 
player 1 and television monitor 2. 

The DVD player 1 has an opening at the front, and 

25 a drive mechanism into which an optical disc is set is 
disposed in the depthwise direction from the opening. A 
remote control receptor 92 comprising an optical recep- 
tor for detecting an infrared beam emitted by the remote 
control 91 is disposed at the front of the DVD player 1. 

30 When a user holds and operates the remote control, the 
remote control receptor 92 emits an interrupt signal indi- 
cating that a key signal has been received. 

A video output terminal and audio output terminal 
are disposed at the back of the DVD player 1 , and by 

35 connecting thereto an AV cord, a video signal repro- 
duced from the DVD can be output to a large-screen tel- 
evision monitor 2 for consumer use. As a result, the user 
can enjoy reproduced video from a DVD on a 33 inch, 
35 inch, or other large television for consumer use. As 

40 will also be known from the above description, a DVD 
player 1 according to the present embodiment is not 
used connected to a personal computer, for example, 
but is used in conjunction with a television monitor 2 as 
a household appliance. 

45 The remote control 91 comprises a keypad with 
resistance springs provided on the case surface thereof, 
and outputs a code corresponding to the depressed key 
by means of an infrared beam. In addition to a POWER 
key, play, stop, pause, fast forward, and reverse keys, 

so the keypad also has an "angle" key for changing the 
angle, and a "setup" key for presenting a setup menu 
whereby parental lock control can be enabled, for exam- 
ple. 

55 Industrial Applicability 

As described above, a multimedia bitstream gener- 
ating method and multimedia optical disc authoring sys- 
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tern according to the present invention are suitable for 
use in an authoring system whereby a title comprising 
bitstreams carrying various information can be edited 
according to user desires to construct a new title, and, 
further, are suitable to a digital video disc system, that 
is, the DVD system, that has been developed in recent 
years. 

Claims 

1. An authoring encoder (EC) in an authoring system 
for generating a bitstream (St35) comprising editing 
units (VOB) contiguously arranged in alternative 
selection periods by segmenting into a specific 
number (M) source streams for carrying moving 
picture data (St1), audio data (St3), and subpicture 
data (St2) information constituting each title having 
a sequence of related content, said authoring 
encoder (EC) comprising: 

means (200, 1 10, and 106) for presenting con- 
tent of said source streams (St1 , St2, and St3) 
in editing units (VOB); 

means (102R) for generating editing instruction 
data (VTBL, ATBL) for the presented editing 
units (VOB); 

verification means (108R) for verifying the edit- 
ing instruction data (VTBL, ATBL), outputting 
the editing data (VTBL, ATBL) when said edit- 
ing instruction data (VTBL, ATBL) is appropri- 
ate, and generating an error signal (St302) 
when said editing instruction data (VTBL, 
ATBL) is not appropriate; and 
means for generating a parameter (St6) for ele- 
mentary encoding said source streams based 
on the output editing instruction data (VTBL, 
ATBL). 

2. The authoring encoder (EC) according to claim 1 , 
wherein the editing instruction data (VTBL, ATBL) 
comprises video management information (VTBL) 
and audio management information (ATBL). 

3. The authoring encoder (EC) according to claim 2, 
further comprising an error information presenta- 
tion means (104) for indicating that either the video 
information (VTBL) or audio information (ATBL) is 
not appropriate based on said error signal (St302). 

4. The authoring encoder (EC) according to claim 2, 
further comprising a period evaluation means (S3) 
for determining whether an alternative reproduction 
period is a parental lock control period or multi- 
angle control period based on the video information 
(VTBL). 

5. The authoring encoder (EC) according to claim 4, 
wherein when said alternative reproduction period 



is a parental lock control period, said verification 
means (108R) determines said period to be appro- 
priate when the specific number (M) is greater than 
a first constant (Mmin) and is less than a second 
5 constant (Mmax), and determines said period to be 
not appropriate when not. 

6. The authoring encoder (EC) according to claim 5, 
wherein said first constant (Mmin) is defined as 

10 Mmin 2s VOBMaxTime/JMT 

where VOBMaxTlme is the longest time of the mov- 
ing picture data (St1) associated with the alterna- 
tive reproduction period, and JMT is the maximum 
jump time of the means for reproducing the bit- 

15 stream, and 

said second constant (Mmax) is defined as 

Mmax £ VOBminTime / (JT + 
(ILUM/BitRate)) 

20 where VOBminTime is the shortest time of the 

moving picture data (St1) associated with the 
alternative reproduction period, JT is the time 
used by the reproduction means for a jump, 
ILUM is the data size of an interleave unit and 

25 BitRate is the transfer rate to the track buffer of 

the reproduction means. 

7. The authoring encoder (EC) according to claim 5, 
wherein when said alternative reproduction period 

30 is a multi-angle control period, said verification 
means (108R) determines said period to be appro- 
priate when all GOP structures in the moving pic- 
ture data (St1) associated with the alternative 
reproduction period are equal, and determines said 

35 period to be not appropriate when not 

8. A bitstream generating method for generating a bit- 
stream (St35) comprising editing units (VOB) con- 
tiguously arranged in alternative selection periods 

40 by segmenting into a specific number (M) source 
streams for carrying moving picture data (St1), 
audio data (St3), and subpicture data (St2) informa- 
tion constituting each title having a sequence of 
related content, said method comprising: 

45 

a step (#100) for presenting content of said 
source streams (St1, St2, and St3) in editing 
units (VOB); 

a means (#200) for generating editing instruc- 
so tion data (VTBL, ATBL) for the presented edit- 

ing units (VOB); 

a verification step (S5, S7) for verifying the edit- 
ing instruction data (VTBL, ATBL), outputting 
the editing instruction data (VTBL, ATBL) when 
55 said editing instruction data (VTBL, ATBL) is 

appropriate, and generating an error signal 
(St302) when said editing instruction data 
(VTBL, ATBL) is not appropriate; and 
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a step (#2200) for generating a parameter (St5) 
for elementary encoding said source streams 
based on the output editing instruction data 
(VTBU ATBL). 

5 

9. The bitstream generating method according to 
claim 8, wherein the editing instruction data (VTBL, 
ATBL) comprises video information (VTBL) and 
audio information (ATBL). 

10 

10. The bitstream generating method according to 
claim 9, further comprising an error information 
presentation step for indicating that either the video 
information (VTBL) or audio information (ATBL) is 

not appropriate based on said error signal (St302). is 

11. The bitstream generating method according to 
claim 9, further comprising a period evaluation step 
(S3) for determining whether an alternative repro- 
duction period is a parental lock control period or 20 
multi-angle control period based on the video infor- 
mation (VTBL). 

12. The bitstream generating method according to 
claim 12, further comprising a step (S5) for deter- 25 
mining, when said alternative reproduction period is 

a parental lock control period, that said period is 
appropriate when the specif ic number (M) is greater 
than a first constant (Mmin) and is less than a sec- 
ond constant (Mmax), and determining said period so 
to be not appropriate when not. 

13. The bitstream generating method according to 
claim 12, wherein said first constant (Mmin) is 
defined as 35 

Mmin £ VOBMaxTime/JMT 
where VOBMaxTime is the longest time of the mov- 
ing picture data (St1) associated with the alterna- 
tive reproduction period, and JMT is the maximum 
jump time of the means for reproducing the bit- <o 
stream, and 

said second constant (Mmax) is defined as 

Mmax <, VOBminTime / (JT + 
(ILUM/BitRate)) 45 
where VOBminTime is the shortest time of the 
moving picture data (St1) associated with the 
alternative reproduction period, JT is the time 
used by the reproduction means for a jump, 
ILUM is the data size of an interleave unit, and so 
BrtRate is the transfer rate to the track buffer of 
the reproduction means. 

14. The bitstream generating method according to 
claim 11, further comprising a step (S7) for deter- 55 
mining, when said alternative reproduction period is 

a mufti-angle control period, that said period is 
appropriate when all GOP structures in the moving 



picture data (St1) associated with the alternative 
reproduction period are equal, and determining 
said period to be not appropriate when not 

15. A manufacturing method for an optical disc medium 
(M) to which is recorded a bitstream (St35) com- 
prising editing units (VOB) contiguously arranged in 
alternative selection periods by segmenting into a 
specific number (M) source streams for carrying 
moving picture data (Stl), audio data (St3). and 
subpicture data (St2) information constituting each 
title having a sequence of related content, said 
method comprising: 

a step (#100) for presenting content of said 
source streams (St1, St2, and St3) in editing 
units (VOB); 

a means (#200) for generating editing instruc- 
tion data (VTBL, ATBL) for the presented edit- 
ing units (VOB); 

a verification step (S5, S7) for verifying the edit- 
ing instruction data (VTBL, ATBL), outputting 
the editing instruction data (VTBL, ATBL) when 
said editing instruction data (VTBU ATBL) is 
appropriate, and generating an error signal 
(St302) when said editing instruction data 
(VTBL, ATBL) is not appropriate; 
a step (#2200) for generating a parameter (St6) 
for elementary encoding said source streams 
based on the output editing instruction data 
(VTBL, ATBL); 

a step (#1800) for generating an encoding 
parameter based on the encoded editing 
instruction data (VTBL, ATBL); 
a step (#2100) for encoding said source data 
based on the generated encoding parameter; 
a step (#2300) for formatting the encoded 
source data according to a recording surface of 
the optical disc medium (M); and 
a step for recording the formatted and encoded 
source data to a recording surface of said opti- 
cal disc medium (M). 

16. The manufacturing method for an optical disc 
medium (M) according to claim 15, further compris- 
ing: 

a period evaluation step (S3) for determining 
whether an alternative reproduction period is a 
multi-angle control period based on the video 
information (VTBL); and 
a step for recording the formatted and encoded 
source data to a recording surface of said opti- 
cal disc medium (M) when said alternative 
reproduction period is a multi-angle control 
period only if all GOP structures in the moving 
picture data (Stl) associated with the alterna- 
tive reproduction period are equal. 
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17. An authoring system for generating a bitstream 
stored to a disc, wherein said bitstream has one or 
more video objects (VOB) containing video informa- 
tion and audio information, and one or more of said 
video objects is a video object (VOB) that is alterna- 5 
tively reproduced from a plurality of video objects 
(VOB), said authoring system comprising: 

a receiving means for receiving encoding con- 
dition information for said video objects, and 10 
selection information for said alternatively 
reproduced video object; 
a storage means for storing said encoding con- 
dition information and selection information; 
a verification means for referencing said selec- is 
tion information and encoding condition infor- 
mation, and verifying whether the encoding 
condition information of the alternatively repro- 
duced video object is appropriate; and 
an encoding means for encoding said video 20 
object according to the encoding condition 
information only when the encoding condition 
information of the alternatively reproduced 
video object is appropriate. 

25 

1 8. The authoring system according to claim 1 7, further 
comprising a feedback means for, when the encod- 
ing condition information of the alternatively repro- 
duced video object is not appropriate, providing 
external feedback indicative of such inappropriate- 30 
ness. 

19. The authoring system according to claim 17, 
wherein alternatively reproduced video objects are 
placed on disc in an interleaved arrangement, 35 

said authoring system further comprising a 
disc reproduction apparatus which comprises a 
track buffer for temporarily storing data read 
from said disc before transfer to a decoder for 40 
video presentation; and 

wherein said verification means verifies 
an interleave period formed by the encoding 
condition information to be inappropriate when 
the track buffer will overflow or the track buffer 45 
will underflow during reproduction. 

20. The authoring system according to claim 19, 
wherein the verification means determines that the 
track buffer will underflow if the number of seg- so 
ments (M) into which an alternatively reproduced 
video object is divided in an interleave period is 
greater than a first constant, and determines that a 
track buffer underflow will occur if said number of 
segments (M) is less than a second constant. ss 
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